AFIT/GSO/ENG/92D-05 


AD-A259  003 

■nniin 


Correlation  Based  Target  Location  and  Identification 


THESIS 

Richard  Alan  Wenzel 
Captain,  USAF 

AFIT/GSO/ENG/92D-05 


93-00146 

■llllllli 


Approved  for  public  release;  distribution  unlimited 


98  7 


1  i2,e 


AFIT/GSO/ENG/92D-05 


Correlation  Based 
Target  Location  and  Identification 


THESIS 


Presented  to  the  Faculty  of  the  School  of  Engineering 
of  the  Air  Force  Institute  of  Technology 
Air  University 
In  Partial  Fulfillment  of  the 


Requirements  for  the  Degree  of 
Master  of  Science  (Space  Operations) 


Richard  Alan  Wenzel,  B.S. 
Captain,  USAF 


December,  1992 


Aooesaion  For 

HTIS  GP.A&I 

13^ 

DTIC  TAB 

□ 

Unannounced 

□ 

Jur,  1 1/  i  c  at  1  on — 

By- 


I  Dl3t:^jLbutioia^ 


Availability  Codes 


Dial 


[Avail  and/or 
Speoial 


Approved  for  public  release;  distribution  unlimited 


Preface 


The  purpose  of  this  study  was  to  determine  the  viability  of  correlation  as 
an  autonomous  target  locator.  A  correlator  was  built  which  utilized  the  routines 
inherent  in  the  KHOROS  software  package  developed  by  the  University  of  New 
Mexico.  A  set  of  180  visible  light,  terrsun  board  images  were  correlated  before  and 
after  pre-processing  and  the  results  compared  against  the  known  target  location. 
A  set  of  rules,  both  quantitative  and  heuristic,  were  developed  as  the  basis  for  a 
rule-based  expert  system. 
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Correlation  Based 
Target  Location  and  Identification 


I.  Introduction 

With  the  current  Department  of  Defense  drawdown,  the  military  of  the  future 
will  have  to  provide  for  the  nation’s  security  with  fewer  people.  One  way  to  minimize 
the  effects  of  this  drawdown  is  by  developing  smart  weapons  that  can  autonomously 
sense,  identify,  track,  and  destroy  enemy  forces.  These  weapons  would  allow  fewer 
people  to  operate  more  equipment,  thus  increasing  the  offensive  capabilities  of  a 
smaller  military  force. 

Development  of  weapons  that  can  autonomovisly  find  and  destroy  enemy  forces 
is  an  extremely  complex  process  which  relies  on  research  in  many  areas  of  pattern 
recognition,  sensor  design,  and  computer  science.  For  a  weapon  to  locate  and  destroy 
a  target,  it  must  have  1)  sensors  capable  of  locating  possible  targets,  2)  an  automated 
target  recognizer  (ATR)  to  discriminate  targets  from  non-targets  or  background 
clutter,  and  provide  this  information  to  the  software  to  make  the  destroy  or  not 
destroy  decision,  and  3)  a  guidance  system  to  lock-on  and  track  the  target  until 
destruction.  The  first  and  third  steps  in  this  process  are  already  within  the  reach 
of  current  technology.  However,  it  is  the  second  step  that  remains  the  unsolved 
problem. 

The  Air  Force  has  spent  over  25  years  researching  and  developing  hardware 
and  software  in  the  attempt  to  design  an  ATR,  for  battlefield  situations.  However, 
due  to  the  complicated  nature  of  the  problem,  a  true  ATR  system  does  not  exist 
today. 
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1.1  Background 


There  axe  many  systems  today  that  provide  a  limited  solution  to  the  object 
recognition  problem.  Some  supermarkets  use  a  laser  beam  to  read  the  universal 
product  code  placed  on  consumer  goods,  ensuring  the  correct  price  is  charged  to  the 
customer.  Similarly,  some  secure  Air  Force  bases  use  scanners  that  read  the  unique 
pattern  of  blood  vessels  found  in  the  retina  for  personnel  identification.  While  these 
examples  work  well,  they  exist  in  a  controlled  environment.  The  object’s  orientation, 
position,  and  motion  are  constrained  to  within  certain  parameters  to  guarantee  quick 
processing.  In  fact,  it  behooves  the  individuals  involved  to  ensure  that  the  targets  are 
presented  to  the  system  correctly.  Otherwise,  the  checkout  clerk  at  the  supermarket 
will  incur  the  wrath  of  the  customer  for  being  too  slow,  and  the  person  seeking 
entry  to  the  base  will  be  temporarily  detained  in  the  retina  scanning  booth  until  the 
security  police  release  them. 

Even  with  the  many  years  of  research  in  this  area,  the  best  target  recognizer 
in  existence  remains  not  a  machine,  but  something  found  in  nature:  the  biological 
vision  system.  While  many  algorithms  have  been  developed  to  simulate  what  the 
eye  and  brain  do  naturally,  only  limited  successes  have  been  reported.  In  fact,  none 
of  these  successes  can  match  what  even  a  pigeon  is  capable  of  doing.  As  reported  by 
Rogers  (15:35),  B.  F.  Skinner,  a  famous  psychologist,  trained  pigeons  to  recognize 
whether  people  were  present  in  an  outdoor  scene.  According  to  Cerella  (5:431),  even 
though  pigeons  have  relatively  small  brains,  they  can  still  sort  slides  into  natural 
categories  such  as  people,  trees,  and  bodies  of  water,  a  capability  that  humans  do 
easily.  As  such,  pigeons  could  detect  people  in  a  scene  approximately  76  out  of  80 
times.  In  fact,  the  pigeons’  discrimination  abilities  were  found  to  be  good  enough 
that  they  would  be  able  to  ’^provide  all  the  guidance  necessary,  to  the  accuracy 
desired”,  to  deliver  munitions  on  target  (15:36). 

The  performance  of  any  ATR  system  is  dependent  on  good  sensor  data.  As¬ 
suming  good  sensor  data  is  available  to  the  ATR  system,  the  task  of  automated  target 
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recognition  is  classically  divided  into  three  steps.  First,  segmentation  labels  individ¬ 
ual  pixels  and  groups  sets  of  pixels  into  classes,  usually  possible  target  or  non-target. 
Second,  feature  extraction  computes  values  which  represent  the  distinguishing  char¬ 
acteristics  of  the  pixel  groups  found  by  segmentation.  Finally,  classification  assigns 
a  label,  usually  which  type  of  target,  to  the  pixel  groups  based  on  the  set  of  features 
found  during  feature  extraction.  However,  specific  techniques  for  ATR  may  “blur” 
this  three  step  process  into  a  different  set  of  steps. 

A  common  technique  used  for  target  location  and  identification  is  correlation 
(1:66)  (9:92).  In  correlation,  a  representative  image  of  a  target,  called  a  template^  is 
compared  to  the  image  of  interest  with  the  goal  of  finding  the  target  in  the  image. 
The  template  is  “overlaid”  onto  the  image  of  interest  at  every  possible  position.  For 
each  new  template  position,  the  correlation  between  the  template  and  the  image  is 
calculated.  The  correlation  function  is  defined  mathematically  as  (8:172): 

Rt0ix,  y)=  [  [  i{Q,  -x,p-  y)dadp  ( 1 ) 

where  i(a,  /?)  is  the  image  being  processed,  t(a  —  x,  /?  — y)  is  the  template  the  image  is 
correlated  with,  *  represents  the  complex  conjugation  operation,  and  Ri,  represents 
the  resulting  correlation.  This  correlation  represents  the  degree  of  similarity  between 
the  template  and  the  image  for  that  particular  pixel  in  the  image. 

In  a  perfect  world,  the  highest  correlation  calculated,  called  the  correlation 
peak,  represents  the  position  in  the  image  where  the  best  match  between  the  template 
and  the  image  occurred.  However,  in  a  non-perfect  world,  the  actual  target  location 
may  not  be  located  at  the  peak,  but  at  the  correlation  minimum,  or  at  a  secondary 
peak.  Correlating  an  image  with  various  templates  will  result  in  potential  targets 
found  for  each  template.  The  decision  then  becomes  which  template  best  matches 
the  image. 
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While  correlation  is  a  useful  tool,  it  does  have  some  limitations.  First,  it  is 
sensitive  to  scale  and  rotation  changes  (10:183).  A  template  represents  only  a  target 
of  a  particular  size  and  rotational  position.  Should  an  image  change  orientation  or 
size  due  to  movement  relative  to  the  sensor,  the  correlation  peak  will  degrade  from 
the  peak  correlation  value.  If  the  target  continues  to  move,  the  correlation  peak  may 
fade  below  the  threshold  and  escape  detection.  An  example  of  this  can  be  found 
in  research  conducted  by  Casasent  and  Psaltis  (4:1795).  In  their  experiments,  the 
signal-to- noise  ratio  decreased  fiom  30  dB  to  3  dB  with  only  a  two  percent  scale 
change  and  a  3.5  degree  rotation  change  of  the  image  being  processed.  The  strength 
of  the  signal  as  compared  to  the  noise  decreased  from  1000  times  greater,  to  only 
two  times  greater. 

One  way  to  overcome  these  problems  is  by  pre-processing  the  raw  image  data 
to  increase  the  chance  for  a  successful  correlation.  Another  possible  way  to  allevi¬ 
ate  these  problems  is  to  use  numerous  templates  that  represent  different  scale  and 
rotation  variations  for  each  possible  target. 

A  second  limitation  is  the  number  of  computations  needed  for  correlation.  For 
an  image  that  is  128  x  128  pixels  in  size,  16384  correlation  values  must  be  calculated, 
each  representing  one  of  the  16384  possible  positions  in  the  image.  Given  a  template 
that  is  32  x  32  pixels  in  size,  a  total  of  1024  pixel-to-pixel  calculations  occur  for  each 
of  the  16384  possible  positions.  In  addition,  a  system  will  normally  use  numerous 
templates  to  represent  the  targets  in  various  scales  and  orientations.  Compounding 
this  is  the  rate  at  which  the  image  will  be  updated  in  the  true  target  acquisition 
scenario.  If  the  sensor  stores  a  representation  of  the  scene  every  second,  the  number 
of  computations  necessary  quickly  becomes  prohibitive.  However,  there  is  a  way 
to  correlate  an  image  with  a  template  using  a  Fourier  transformation  (9:81).  The 
frequency  domain  representation  of  the  correlation  function  can  be  written  as: 
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(2) 

(3) 

(4) 


Rt,{x,y)  =  /  /  Hi,v)T^iC,v)exp\j2T{{^x  +  riy)](l^dT} 

y— oo  •/— 00 

=  J-’l/(u,t.)r(t.,i.)] 

where  ^  and  rj  are  the  spatial  frequency  coordinates,  /(^,  q)  is  the  image  in  the 
frequency  domain,  and  T*(4,  rj)  is  the  complex  conjugate  of  the  frequency  domain 
representation  of  the  template,  ^  is  the  Fourier  transformation  operation,  and 
is  the  inverse  Fourier  transformation  operation.  Each  image  and  template  is  trans¬ 
formed  from  the  space  domain  to  the  Fourier  domain,  resulting  in  both  a  real  and 
imaginary  number  for  each  pixel  in  both  the  template  and  the  image.  The  template 
is  further  processed  by  conjugation,  which  simply  reverses  the  sign  on  the  imaginary 
part  of  each  pixel.  The  image  and  template  are  then  multiplied  pixel  by  pixel  and 
inverse  Fourier  transformed  back  to  the  space  domain.  The  resulting  image  is  the 
2-D  correlation  plane  as  described  above,  but  was  found  with  far  fewer  computatioLs 
for  any  reasonably  sized  image. 

Another  limitation  of  correlation  is  its  sensitivity  to  background  clutter.  Where 
no  distribution  of  background  clutter  is  known  a  priori,  the  target  of  interest  can  be 
effectively  hidden  from  the  correlator.  In  fact,  a  pixel  with  a  large  intensity  value 
can  be  sufficient  to  skew  the  correlation  peak  completely  off  the  target  (1:67).  To 
overcome  this  limitation,  a  technique  called  energy  normalization  can  be  used  to 
maximize  the  target  energy  or  contrast  with  respect  to  the  background  clutter. 

This  research  will  focus  on  the  problem  of  locating  and  identifying  targets 
within  a  cluttered  background  using  correlation  via  Fast  Fourier  IVansforms  and 
energy  normalization.  No  assumption  will  be  made  as  to  the  amount  of  background 
clutter  resident  in  the  image. 
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1.2  Problem  Definition 


A  reliable  target  recognition  system  capable  of  locating  and  identifying  objects 
in  cluttered  images  does  not  exist  today.  This  system  must  process  sensor  data, 
reliably  identify  targets,  and  provide  appropriate  information  to  the  weapon  guidance 
system.  This  effort  will  investigate  correlation  to  determine  if  it  is  a  viable  technique 
for  locating  and  identifying  targets  in  visible  light  terrain  board  imagery. 

1.3  Materials  and  Equipment 

The  images  used  for  this  effort  are  photographs  of  terrain  boards.  A  terrain 
board  is  a  physical  scale  model  of  possible  hostile  targets  among  various  backgrounds 
and  occlusions.  It  is  made  up  of  smaller  scale  versions  of  objects  possibly  found  on 
a  battlefield,  such  as  bushes,  trees,  hills,  valleys  and  boulders,  as  well  as  hostile 
tanks,  trucks,  and  airplanes.  These  objects  are  arranged  in  various  configurations  to 
simulate  battlefield  topography  and  the  enemy’s  use  of  topography  for  camouflage 
or  cover  during  combat  situations.  The  images  are  in  the  visible  frequency  domain, 
that  is,  they  represent  black  and  white  images  discemable  by  the  human  eye. 

A  total  of  180  images  were  used  for  this  project.  All  images  are  visible  light 
terrain  board  images  of  Ml  tanks,  trucks,  and  airplanes  with  varying  amounts  of 
clutter,  ambient  light,  and  camera  elevation.  Specifically,  the  images  used  for  this 
project  consist  of  84  Ml  tank  images  with  various  aspect  angles  and  amounts  of 
background  clutter  (reference  Figure  1),  18  airplane  images  showing  airplanes  from 
above  (reference  Figure  2),  and  78  truck  images,  also  showing  the  trucks  from  above 
(reference  Figure  3). 

These  images  are  stored  in  the  HIPS  format  which  is  basically  a  header  ending 
with  a  period  (.)  followed  the  binary  gray  scale  representations  of  the  pixels  in  the 
image.  Each  pixel  is  encoded  with  a  brightness  intensity  on  the  scale  from  absolute 
black  (0)  to  absolute  white  (255).  The  image  consists  of  16384  pixel  values  which 
correspond  to  an  image  size  of  128  x  128  pixels.  A  more  detailed  description  of  the 
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Figure  1.  Sample  Tank  Image 


HIPS  format  can  be  found  in  Appendix  B  along  with  routines  to  perform  various 
image  format  conversions  found  in  Appendix  C. 

The  hardware  used  for  this  research  consist  of  the  SUN  SPARC  2  workstations 
and  the  Silicon  Graphics  workstations.  The  software  used  consists  of  both  “off-the- 
sheir’  and  application  specific  programs.  The  “off-the-shelP  software  was  KHOROS, 
an  integrated  image  and  signal  processing  software  environment  developed  by  the 
faculty  and  students  at  the  University  of  New  Mexico(14).  KHOROS  is  basically 
a  large  library  of  image  and  signal  processing  routines  that  can  be  used  singly  or 
in  groups  to  perform  such  functions  as  transforms  and  data  conversions.  The  ap¬ 
plication  specific  programs  consist  of  C  programs  used  for  image  pre-processing,  file 
manipulations,  and  data  conversions  written  specifically  for  this  effort. 

1.4  Scope  and  Limitations 

This  research  concentrates  only  on  the  problem  of  target  location  and  identi¬ 
fication  in  cluttered  images.  Specifically,  this  effort  will  evaluate  the  effectiveness  of 
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Figure  2.  Sample  Plane  Image 

correlation  as  a  target  locator  and  identifier  in  visible  light  imagery.  The  following 
limitations  are  imposed  on  this  eflfort: 


•  Detecting  the  presence  of  a  target  will  not  be  addressed.  All  images  are  as¬ 
sumed  to  have  a  target  present.  Previous  AFIT  research  by  Mayo  (13),  TYoxel 
(20),  and  Walrond  and  Childress(21)  have  addressed  target  detection. 

•  The  optical  implementation  and  testing  of  the  research  methods  will  not  be 
conducted.  These  are  subjects  worthy  of  their  own  research  efforts. 

•  Multiple  target  recognition  will  not  be  addressed.  Each  image  is  assumed  to 
have  only  one  target  present. 


1.5  Thesis  Overview 

This  thesis  is  organized  into  five  chapters  with  seven  appendices.  The  material 
in  the  following  chapters  is  grouped  logically  by  types  of  images  processed;  raw 
images  and  energy  normalized  images.  The  appendices  contain  all  software  developed 
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Figure  3.  Sample  TVuck  Image 


in  support  of  this  effort  as  well  as  background  information  provided  to  help  readers 
with  less  experience  in  correlation  image  processing. 

This  chapter  is  an  introduction  to  the  thesis  topic,  its  scope,  and  materials  and 
equipment  used.  Chapter  II  contains  a  literature  review  showing  how  similar  prob¬ 
lems  have  been  studied  in  the  past.  Chapter  III  discusses  the  research  methodology 
and  tests  conducted  to  verify  the  integrity  of  various  software  routines  developed. 
Chapter  IV  reports  the  results  of  the  research.  Chapter  V  presents  conclusions 
reached  through  this  effort. 
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II.  Literature  Review 


This  chapter  presents  past  research  efforts  that  attempted  to  solve  similar  ATR 
problems. 

2.1  Daugman  Research 

Daugman  (7)  cites  research  on  the  mammalian  visual  nervous  system  (retina, 
lateral  geniculate,  and  primary  visual  cortex)  as  motivation  for  using  Gabor  filters  to 
transform  input  images  and  achieve  better  segmentation  results.  He  proves  that  2-D 
(two-dimensional)  Gabor  filters  can  “capture  critical  neurobiological  variables  of  a 
given  neuron’s  orientation  and  spatial  frequency  preference,  the  tuning  bandwidths 
for  these  variables,  the  receptive  field  dimensions  and  the  relationships  among  all  of 
these  parameters”.  In  fact,  97  percent  of  mammalian  cortical  simple  cells  have  2-D 
receptive  field  profiles  which  can  be  well  fit  by  2-D  Gabor  elementary  functions. 

Daugman  used  a  three  layered  artificial  neural  network  to  find  better  Gabor 
filters.  The  network  had  fixed  weights  on  the  first  and  third  layers  based  on  the  2-D 
Gabor  elementary  functions  and  the  second  layer  having  adjustable  weights.  The 
input  image  was  composed  of  various  textures  of  anisotropically  filtered  white  noise 
fields  and  was  successfully  segmented  into  its  various  texture  regions. 

2.2  Hadett  Research 

Hazlett  (11)  investigated  segmenting  high  resolution  synthetic  aperture  radar 
(SAR)  images  using  both  Gabor  filters  and  radial  basis  functions.  Each  2048  x 
2048  pixel  SAR  image  was  normalized  via  a  fast  Fourier  transform  to  minimize  the 
differences  due  to  various  image  sources. 

The  initial  set  of  Gabor  filters  were  those  suggested  by  Daugman.  From  this 
starting  point,  complete  Gabor  filter  sets  for  various  window  sizes  were  generated. 
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The  optimal  set  of  filters  were  chosen  using  two  methods.  First,  the  power  spectrum 
was  computed  and  the  two  highest  peaks  were  chosen.  Second,  specific  windows  of 
the  input  image  with  known  content,  such  as  trees,  grass,and  shadows,  were  pro¬ 
cessed  and  the  three  filters  having  the  greatest  magnitude  coeflScients  were  selected. 
The  filters  thus  selected  were  then  combined  and  the  three  filters  with  the  greatest 
occurrences  were  selected  as  the  final  set. 

Hazlett  then  used  the  Gabor  filter  coeflScients  as  training  vectors  into  a  radial 
basis  function  neural  network.  The  number  of  clusters  used  to  build  the  neural  net¬ 
work  was  varied  so  that  an  optimum  number  could  be  found.  From  this,  he  achieved 
an  overall  accuracy  of  81  percent.  However,  there  was  some  shortcomings  to  this 
process.  The  transition  regions  between  trees  and  shadows  were  often  misclassified 
as  grassy  which  limits  the  overall  accuracy  of  the  process. 

2.3  Ruck  Research 

Ruck  (17)  studied  ways  to  segment  an  image  using  a  multi-  function  laser 
radar  sensor  using  both  doppler  and  relative  range  images.  He  used  two  techniques 
to  segment  the  different  types  of  images,  optimum  thresholding,  and  the  technique 
developed  by  C.  Tong  in  his  1986  thesis  Target  Segmentation  and  Image  Enhance¬ 
ment  through  Multisensor  Data  Fusion  (19). 

To  segment  the  doppler  images,  he  used  three  different  threshold  based  ap¬ 
proaches  which  all  were  ultimately  binary  optimum  thresholding  techniques.  The 
optimum  threshold  approach  involves  the  idea  that  there  exist  only  two  tsqje  of  pix¬ 
els  in  the  image,  in  this  case,  target  and  nontarget.  Any  pixel  with  a  value  higher 
than  the  threshold  value  is  considered  a  target  pixel  and  any  below  the  threshold  is 
considered  a  nontarget  pixel. 

However,  for  segmentation  of  the  relative  range  images,  he  used  the  algorithm 
developed  by  C.  Tong.  This  algorithm  assumed  that  the  range  gradient  (the  differ¬ 
ence  between  maximum  and  minimum  values  of  the  range)  will  be  relatively  low  for 
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targets  while  the  range  gradient  will  be  relatively  high  for  nontarget  (background) 
images. 

From  the  two  segmentation  techniques,  Ruck  was  able  to  create  a  list  of  all 
the  different  regions  found  in  the  input  images.  He  did  this  using  two  different 
classification  decision  rules: 

•  a  statistical  nearest  neighbor  approach 

•  a  biologically-based  neural  network  multilayer  perceptron 

Ruck’s  conclusion  is  that  the  multilayer  perceptron  performed  is  statistically 
equivalent  to  the  nearest  neighbor  classifier. 

2.4  Roggemann  Research 

Roggemann  (16)  studied  Forward-Looking  Infrared  (FLIR)  sensor  images  to 
develop  a  segmentation  algorithm.  He  based  his  algorithm  on  the  following  observar 
tions: 

•  the  targets  usually  have  higher  temperatures  than  the  background 

•  the  targets  usually  were  differentially  heated  due  to  operation  and  sun  warming 

•  the  targets  occupied  a  small  fraction  of  the  total  pixels  found  in  the  image 

With  these  assumptions,  an  optimum  threshold  approach  was  used.  Since  FLIR 
images  are  a  measure  of  the  apparent  relative  temperature  of  the  pixels,  and  targets 
are  considered  hotter  than  the  background,  any  pixel  higher  than  a  threshold  value 
is  considered  a  target  pixel  and  any  lower  than  that  value  is  considered  a  background 
pixel.  However,  selecting  a  threshold  value  is  a  problem.  With  two  types  of  regions 
resident  in  the  image,  most  expect  a  bi-modal  distribution  when  the  pixel  intensities 
are  placed  in  a  histogram  and  a  good  threshold  value  would  be  the  mean  of  the  two 
median  values  (9:354).  In  Roggemann ’s  imagery,  the  target  pixels  are  relatively  rare 
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as  compared  to  the  background  pixels.  This  resulted  in  the  histogram  representation 
of  pixel  values  not  showing  the  expected  bi-modal  distribution. 

To  overcome  this  problem,  Roggemann  developed  an  adaptive  algorithm  to 
choose  the  appropriate  threshold  value  and  thus  segment  the  image.  He  then  used  a 
set  of  heuristic  operations  to  reject  various  pixels  that  were  characterized  as  targets 
but  were  deemed  not  targets.  Results  of  this  ATR  system  were  found  to  be  useful, 
but  not  excellent.  Roggemann  cites  the  need  for  development  of  better  heuristic 
segmentation  operations. 

2.5  Smiley  Research 

Smiley  (18)  showed  that  cluttered  SAR  imagery  could  be  segmented  using 
parts  of  a  multiresolution  representation  produced  by  dilations  and  translations  of  a 
wavelet  transform.  This  process  involved  three  steps: 

•  generating  the  multiresolution  representation  of  the  image 

•  extracting  the  features  from  that  representation 

•  segment  the  multiresolution  representation  using  a  radial  basis  function  arti¬ 
ficial  neural  network  into  different  homogeneous  regions  such  as  trees,  fields, 
and  shadows 

Each  image  is  transformed  using  an  affine  wavelet,  which  can  be  thought  of  as 
filtering  the  image  through  a  progression  of  low-pass  filters  with  decreasing  band- 
widths.  This  results  in  a  set  of  representations  of  the  original  image,  each  with 
successively  lower  resolution.  From  here,  an  optimum  level  of  representation  is  cho¬ 
sen.  This  chosen  level  has  associated  with  it  certain  values  which  represent  the 
frequency  content  of  the  image.  These  values  are  then  used  as  features  and  fed  into 
a  radial  basis  function  artificial  neural  network  as  a  way  to  train  the  network.  Im¬ 
ages  to  test  the  network  were  similarly  transformed  and  fed  into  the  neural  network. 
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Smiley’s  results  show  that  the  neural  network  could  segment  both  natural  regions  as 
well  as  man-made  objects  from  the  surrounding  region. 

2.6  Law  Research 

Law  (12)  investigated  object  tracking  using  adaptive  correlation  techniques. 
Forward  Looking  Infrared  Radar  (FLIR)  images  of  various  targets  were  collected  at 
one  second  intervals  from  a  DC-3  aircraft  as  it  flew  from  a  range  of  roughly  10  kilo¬ 
meters  until  it  was  directly  above  the  targets.  These  images  were  then  digitized  for 
processing  using  adaptive  correlation.  Each  image  processed  by  the  adaptive  corre¬ 
lator  became  the  template  for  the  subsequent  image  processing.  In  other  words,  the 
correlator  “adapted”  to  the  changing  scene  by  updating  the  template  each  processing 
cycle.  This  prevented  the  correlation  peak  from  fading  as  the  target  deviated  from 
the  scale  and/or  rotation  represented  in  the  image. 

Law’s  results  showed  adaptive  correlation  to  be  a  viable  technique  for  track¬ 
ing  infrared  targets  if  two  conditions  were  met.  First,  the  time  difference  between 
successive  frames  must  be  small,  representing  a  high  scan  rate,  and  the  processing 
cycle  of  the  correlator  must  be  as  fast,  or  faster,  than  the  image  scan  rate.  Second, 
an  image  enhancement  technique  must  be  used  (12:95-96). 

2.7  Booz,  Allen,  &  Hamilton  Research 

Previous  work  by  Booz,  Allen,  &  Hamilton  for  Wright  Laboratories  Advanced 
Systems  Research  Group  (WL/AAAT-1)  was  successful  in  segmenting  visible  light, 
terrain  board  imagery  (2).  Their  approach  was  to  have  an  operator  hand  segment 
representative  images  to  extract  the  target  of  interest  for  use  as  templates.  These 
templates  were  then  used  to  extract  11x11  sub-images,  called  features,  which  became 
the  templates  used  for  the  correlation  step.  The  relative  spatial  locations  of  the 
features  in  the  original  image  were  also  noted  for  use  in  the  performance  (object 
location)  phase.  The  features  and  spatial  locations  were  then  fed  into  the  Artificial 
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Neural  Vision  Learning  System  {ANVIL)  which  used  a  neural  network  architecture 
for  the  target  identification  process. 

This  work  was  successful  in  that  it  proved  to  be  robust  to  scale  and  perspective 
changes,  provided  that  it  was  trained  with  imagery  that  had  a  variety  of  scales  and 
perspectives  represented.  The  amount  of  robustness  was  found  to  be  ±  10  degrees 
in  azimuth,  ±  5  degrees  in  elevation,  and  ±  25  percent  in  scale(2:2). 

2.8  Summary 

Overall,  there  are  many  techniques  that  attempt  to  simulate  what  the  mam¬ 
malian  brain  does  naturally.  Each  technique  contains  inherent  strengths  to  be  maxi¬ 
mized  and  limitations  to  be  overcome.  Investigating  all  of  them  would  require  much 
more  time  than  allowed  for  this  effort.  As  such,  only  one  technique  will  be  studied: 
correlation.  A  description  of  the  correlator  and  its  implementation  as  well  as  the 
research  methodology  follow  in  the  next  chapter. 


15 


III.  Research  Methodology 


The  correlator  used  on  the  raw  images  for  this  effort  was  implemented  by 
routines  inherent  in  the  KHOROS  software  and  is  similar  to  the  correlator  developed 
by  Law  (12).  A  description  of  the  routines  used  is  located  in  Appendix  E.2. 

Once  the  correlator  was  built,  the  next  step  was  to  verify  the  integrity  of  the 
correlator  by  a  series  of  tests.  A  128  x  128  pixel  test  image  was  created  which  is 
composed  of  a  completely  white  square  target  (pixel  values  of  255.0)  on  a  black  (pixel 
values  of  0.0)  background.  The  test  image  also  contains  some  white  speckle  and  a 
10x10  pixel  region  of  high  intensity  pixels.  The  target  has  an  upper  left  coordinate 
of  (83,42)  and  has  a  width  and  height  of  30  pixels.  The  known  center  of  the  target 
is  at  pixel  location  (98,57).  The  test  image  is  shown  in  Figure  4. 


Figure  4.  Correlator  Test  Image 


The  purpose  of  the  tests  was  to  verify  the  operation  of  the  correlator  and  its 
sensitivity  to  noise  and  intensity  variations.  Results  of  the  test  are  included  in  the 
next  chapter. 
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The  following  steps  compose  the  research  methodology: 


•  Truth  data  -  Each  test  image  was  hand-tnithed.  The  smallest  box  that  com¬ 
pletely  surrounds  the  target,  without  touching  the  target,  was  noted  and  its 
centroid  calculated  by  the  following  equation: 


.  width 

^mid  —  ^min  d"  T  ) 


.  ,  height . 

Umid  ~  ymin  d"  tTlty  ^  ) 

where  Xmid  and  ymid  represent  the  centroid  of  the  target,  and  ymin  repre¬ 
sent  the  “upper  left”  coordinate  values  used  to  truth  the  image  using  KHOROS 
(by  KHOROS  convention),  width  is  the  target  width  in  pixels,  and  height  is 
the  target  heights  in  pixels.  These  four  values,  Xmint  ymm»  tvidth,  and  height 
completely  define  the  box  drawn  around  the  target  and  form  the  basis  for  all 
correlator  experiments.  A  list  of  the  image  names  and  the  truthing  data  used 
for  correlation  is  found  in  Appendix  D. 

•  Image  pre-processing  -  Each  image  was  pre-processed  by  energy  normalization 
and  local  energy  normalization  to  enhance  the  effectiveness  of  correlation  as  a 
target  recognizer. 

•  Pre-processed  Image  Correlation  -  These  pre-processed  images  were  then  corre¬ 
lated  with  the  known  target  inherent  to  that  image.  The  resulting  correlation 
peak  and  correlation  minimum  were  noted.  The  images  were  then  correlated 
with  various  templates  of  similar  targets  with  the  correlation  peaks  noted.  Fi¬ 
nally,  the  entire  energy  normalized  image  set  was  correlated  across  all  templates 
and  the  peaks  noted. 

•  Raw  Image  Correlation  -  Each  test  image  was  correlated  with  a  template  made 
from  the  target  located  in  that  image  surrounded  by  minimum  value  pixels. 
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The  resulting  correlation  peak,  correlation  minimum,  and  secondary  peaks  were 
noted  as  possible  target  locations. 

•  Comparison  -  The  correlation  peak  and/or  correlation  minimum  found  from 
the  various  correlation  steps  were  compared  with  the  known  target  centroid 
and  the  distance  between  the  two  calculated  by  the  following  equation: 

~  "b  {.ymid  J/c)^ 

where  Xc  and  Pe  represent  the  position  of  the  correlation  peak  or  minimum, 
and  D  represents  the  distance  in  pixels. 

•  Heuristic  target  location  -  Any  image  that  failed  both  the  correlation  peak 
and  correlation  minimum  criteria  had  its  3-D  correlation  plane  examined  for  a 
secondary  peak.  When  this  failed,  the  2-D  correlation  plane  was  also  examined 
in  an  attempt  to  identify  a  heuristic  rule  or  rules  for  locating  the  target. 

•  Success/Failure  decision  -  The  distance  D  calculated  was  used  to  determine 
success  or  failure.  Any  distance  greater  than  one  half  the  diagonal  of  the 
box  encircling  the  target  was  considered  a  failure.  Also,  any  image  failing 
the  distance  criteria  was  examined  for  success/failure  using  the  heuristic  rules 
developed.  When  the  entire  image  set  was  correlated  with  the  entire  template 
set,  the  best  correlation  peak  noted  was  also  examined  as  to  which  type  of 
target  had  been  identified,  with  an  incorrect  target  identification  resulting  in 
a  failed  correlation. 

3. 1  Summary 

This  chapter  reported  the  correlator  and  research  methodology  used  for  this  ef¬ 
fort.  The  next  chapter  reports  results  from  the  correlator  operations  checks,  followed 
by  the  actual  correlation  results  and  success  rates. 
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IV.  Findings 


This  chapter  reports  the  results  achieved  in  this  effort.  The  first  section  reports 
the  operational  check  of  the  correlator.  The  second  section  reports  the  sensitivity 
of  the  correlator  to  low  variability  pixel  images.  The  final  section  reports  the  actual 
correlation  results  achieved  on  the  set  of  images. 

4.1  Correlator  Operation  Check 

The  first  test  checks  for  proper  correlator  operation.  The  target  (reference 
Figure  5)  was  extracted  and  centered  in  the  template,  with  a  background  of  zero 
pixel  values  (Reference  Figure  6).  The  test  image  was  then  correlated  with  the 
template  and  the  correlation  peak  noted  (Reference  Figure  7  and  Figure  8).  The 
statistics  from  the  test  image  correlation  (reference  Figure  9)  show  the  correlation 
peak  was  found  at  pixel  location  (98,57)  as  expected.  An  additional  check  can 
be  made  with  simple  mathematics.  Correlating  a  30  x  30  square  of  255.0  pixel 
values  with  a  30  x  30  square  of  255.0  pixel  values  resulted  in  a  correlation  peak  of 
30^  X  255^  =  5.85225x10^.  This  also  matches  the  magnitude  of  the  correlation  peak 
found  in  Figure  9. 

4.2  Correlator  Sensitivity  Check 

The  next  test  checked  the  sensitivity  of  the  correlator.  The  test  image  is 
identical  to  the  previous  test  image,  except  that  the  background  pixels  are  set  to 
254.0  rather  than  0.0.  If  the  correlator  is  working  correctly,  then  the  correlation  peak 
will  also  be  found  at  pixel  location  (98,57)  as  noted  above.  Referencing  Figure  10 
and  Figure  11,  the  correlation  peak  was  found  where  it  should  be. 
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Figure  5.  Correlator  Test  Image 


Figure  6.  Correlator  Test  Image  Template 
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Figure  7.  2-D  Test  Image  Correlation  Plane 


Figure  8.  3-D  Test  Image  Correlation  Plane 
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'Imb*  Statistics  for  File  NaM:  /tap/vctorKmalTTlS 


Statistics  for  Band  *0 


Hewi:  3.6S766e*06 
Variance:  7.233S2e»13 
Std.  Oev:  8.S05Me*<l6 
RHS:  9.25793e*C6 

Peaks: 

High:  5.8S22Se*07  at  (X,Y>  location:  (98,57) 

Loy  :  -4  at  (X.Y)  location:  <43,80) 

Total  Integral  under  the  Inage:  5.9927e«10  _ 

Positive  part  of  Integral  wdar  the  Inage:  5.99270*10 

Negative  part  of  Integral  isidar  the  inage:  -260.386 

Contributing  points:  16364 

Nunber  of  positive  points  In  Inage:  15914 

Nunber  of  negative  points  In  Inage:  470 

Skeuness:  3.13534 

Kurtosis:  10.2604 


Figure  9.  Test  Image  Correlation  Statistics 


4.3  Energy  Normalization  Check 

The  final  test  checks  for  proper  energy  normalization  of  the  images.  Each 
image  was  normalized  using  the  enorm.c  program  found  in  Appendix  D.8  and  stored 
in  a  new  file.  This  new  file  was  then  used  as  the  input  into  the  program  checkjiorm.c 
found  in  Appendix  D.ll.  This  program  simply  read  in  the  energy  normalized  file 
one  pixel  at  a  time,  squared  the  pixel  value,  and  then  added  this  value  to  a  running 
sum.  When  completed,  the  running  sum  should  be  equal  to  the  total  energy  in  the 
image,  which  should  be  unity,  within  the  limitation  of  round  off  error  inherent  to  the 
computer  being  used.  Running  checkjnorm.c  on  Ml-e0n2wl28s2.asc  after  energy 
normalization  resulted  in  a  total  energy  value  of  .999974. 

A  second  test  of  energy  normalization  was  also  run  using  the  correlator.  A  test 
image  consisting  of  a  solid  white  square  on  a  black  background  was  generated  and 
energy  normalized.  This  image  was  then  run  through  the  correlator  and  the  corre¬ 
lation  peak  found.  If  the  energy  normalization  worked  as  expected,  the  correlation 
peak  should  be  unity.  A  test  ima^e  whose  known  center  is  at  pixel  location  (57,48) 
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Figure  10.  3-D  Test  Image  Correlation  Plane 
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Figure  11.  Sensitivity  Test  Image  Statistics 
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was  energy  normalized  and  correlated.  Referencing  Figure  12,  the  correlation  peak 
is  1.00003  located  at  location  (57,48)  as  expected. 
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Figure  12.  Energy  Normalization  Test  Statistics 


4.4  Image  Correlation 

The  various  versions  of  the  images,  both  energy  normalized  and  raw,  were 
correlated  using  the  C  program  correlate.c  located  in  Appendix  D.5.  Correlate.c 
is  simply  a  program  that  executes  the  C  Shell  program  correlate  for  each  image. 
This  C  Shell,  located  in  Appendix  D.6,  executed  the  appropriate  KHOROS  routines 
to  find  the  correlation  of  the  image  with  a  cutout  of  the  target  from  that  image, 
using  the  fast  Fourier  transform  routine  inherent  to  KHOROS.  The  local  energy 
normalization  of  a  cutout  target  with  its  mother  image  was  accomplished  using  the 
C  program  localnorm.c  found  in  Appendix  D.15.  Standard  energy  normalization 
was  accomplished  with  the  C  program  enorm.c  and  the  C  shell  doenorm  found  in 
Appendices  D.8  and  D.9,  respectively.  The  results  from  the  various  correlations  are 
reported  next. 

4.4-1  Energy  Normalized  Images  To  energy  normalize  an  image,  each  pixel 
in  the  image  is  normalized  to  a  value  between  0  and  1  by  dividing  by  the  total  energy 
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in  the  image,  where  total  energy  is  calculated  by: 


Erotcd  —  /EE 

where  I(x,y)  is  the  intensity  of  the  pixel  at  location  (x,y)  in  the  image.  Energy 
normalization  is  designed  to  decrease  the  impact  made  a  spurious  pixel  or  group 
of  pixels  with  high  intensity  gray  scale  values  by  incorporating  more  of  the  target 
shape  into  the  correlation.  It  decreases  these  spikes  more  than  the  lower  values.  The 
hope  is  to  de-emphasize  the  spurious  peaks  to  the  point  that  the  true  shape  of  the 
target  determines  the  correlation  peak. 

The  target  located  in  each  image  was  extracted  using  the  data  found  in  Ap¬ 
pendix  E  and  placed  in  the  center  of  a  template  surrounded  by  minimum  pixel  values 
via  the  program  make-template. c  found  in  Appendix  D.12.  Once  the  template  was 
made,  both  the  original  image  and  the  template  were  energy  normalized  using  the 
C  program  enorm.c  found  in  Appendix  D.8  as  called  by  the  C  Shell  doenorm  found 
in  Appendix  D.9.  Once  the  image  and  template  were  energy  normalized,  they  were 
correlated  using  the  C  Shell  new-correlate  found  in  Appendix  D.13  as  it  was  called 
by  the  C  program  do-new-correlate.c  found  in  Appendix  D.14.  The  resulting  correla¬ 
tion  statistics  were  examined  and  the  location  of  the  correlation  peak  and  correlation 
minimum  noted. 

The  results  showed  limited  success.  Of  the  84  Ml  tank  images,  three  found  the 
known  target  center  as  the  correlation  peak  (four  percent).  Of  the  18  airplane  on 
runway  images,  all  failed  to  find  the  known  target  center  (0  percent).  Finally,  out  of 
the  78  truck  images,  30  were  able  to  find  the  known  target  center  (38  percent).  This 
resulted  in  an  overall  18  percent  success  rate  for  finding  targets  in  raw  images.  For 
the  first  cut,  this  implies  that  this  correlation  is  technique  is  not  a  viable  alternative 
for  this  problem. 
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Analysis  shows  why  this  is  true.  The  locations  of  the  correlation  peaks  found 
are  located  in  areas  of  “medium”  pixel  values.  Since  the  templates  are  created  with 
a  square  encompassing  the  target,  some  background  pixels  become  part  of  the  tem¬ 
plate.  These  background  pixels  are  sufficient  to  skew  the  correlation.  Heuristically, 
the  template  is  made  up  of  a  mixture  of  light  and  dark  pixel  values,  which  have  a  me¬ 
dian  value.  Correlating  the  template  with  a  region  of  nearly  homogenous  pixel  values 
which  are  at  or  near  the  median  value  of  the  template  will  result  in  the  correlation 
peak  being  found  there. 

However,  an  interesting  occurrence  was  noted  when  viewing  some  of  the  2-D 
correlation  planes.  In  some  of  the  2-D  correlation  planes,  there  is  a  region  of  low 
pixel  values  approximately  the  size  of  the  target.  Reference  Figure  13  and  Figure  14. 
Heuristically,  this  lends  credibility  to  the  technique  of  finding  the  location  of  the 
minimum  correlation  value  vice  the  correlation  peak.  Using  this  new  technique,  the 
images  are  again  correlated. 

Using  the  correlation  minimum,  24  of  84  tank  images  (29  percent),  none  of  the 
airplane  images  (0  percent),  and  28  of  78  truck  images  (36  percent)  were  located 
for  a  success  rate  of  52  out  of  180  (29  percent).  Combining  the  two  techniques  and 
accounting  for  the  two  tank  image  and  eight  truck  image  overlap,  gives  an  overall 
success  rate  of  25  of  84  tank  images  (30  percent),  none  of  the  airplane  images  (0 
percent),  and  50  of  78  truck  images  (64  percent)  located  for  a  total  success  rate  of 
75  out  of  180  (42  percent).  A  successful  image  correlation  is  shown  in  Figures  15 
and  16.  An  unsuccessful  image  correlation  is  shown  in  Figures  17  and  18. 

A  second  energy  normalization  technique,  called  local  energy  normalization 
was  then  used.  Each  image  was  read  in  and  its  target  extracted  using  the  truth 
data.  The  target  pixels  were  energy  normalized  and  stored  in  the  template.  The 
template  was  then  overlayed  at  all  possible  positions  in  the  original  image,  including 
the  edges  by  using  wrap  around.  At  each  position,  the  pixels  under  the  template 
were  locally  energy  normalized  and  stored.  The  energy  normalized  image  pixels  were 
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Figure  13.  Original  Image 


Figure  14.  Local  minimum  Correlation  Plane.  Notice  the  large  group  of  black  pixels 
at  and  around  the  center  of  the  known  position  of  the  tank. 
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Figure  15.  Original  Image 


Figure  16.  Raw  Image  2-D  Correlation  Plane.  The  cross-hairs  indicate  the  location 
of  the  target  found  by  correlation. 


Figure  17.  Original  Image 


Figure  18.  Raw  Image  2-D  Correlation  Plane.  The  left-most  cross-hairs  indicate 
the  location  of  the  actual  target  and  the  right-most  cross-hairs  indicate 
the  target  location  found  by  the  correlation. 
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then  multiplied  point-by-point  by  the  template  pixels  and  summed.  The  total  sum 
became  the  correlation  value  at  that  position  in  the  ima..ie.  This  algorithm  was 
implemented  by  the  C  program  loco/norm.c  found  in  Appendix  D.15. 

The  results  from  local  energy  normalization  were  encouraging.  All  of  the  180 
images  found  the  known  target  center.  Moreover,  the  correlation  peak  was  found  to 
be  unity  in  each  case,  as  was  expected.  Examples  of  an  image  and  its  2-D  c'n  relaiion 
plane  are  shown  in  Figure  19  and  Figure  20.  This  technique  is  analogous  to  assuming 
a  perfect  set  of  templates  available  to  the  correlator  for  target  location.  The  results 
show  that  under  this  assumption,  correlation  is  a  viable  technique  when  some  means 
of  generating  good  templates,  possibly  via  a  model  based  approach,  is  available. 

The  next  logical  step  in  this  process  was  to  assume  less  than  perfect  templates 
available  to  the  correlator.  All  180  images  were  inspected  to  determine  a  set  of 
templates  that  represent  the  target  at  various  scales  and  orientations.  With  this, 
14  templates  were  chosen:  four  Ml  tank,  three  F-16  airplane,  one  A-10  airplane, 
and  six  truck  templates.  The  14  chosen  templates  are  shown  in  Figure  21.  With 
the  limited  number  of  airplane  images  in  the  image  set,  all  airplane  results  will  be 
reported  together. 

Once  the  templates  were  chosen,  like  images  were  correlated  with  like  tem¬ 
plates.  For  example,  the  entire  set  of  84  Ml  tank  images  were  local  energy  normal¬ 
ized  and  correlated  by  the  four  Ml  tank  templates,  which  were  also  local  energy 
normalized.  The  airplane  and  truck  images  were  also  correlated  with  the  airplane 
and  truck  templates,  respectively.  This  technique  is  analogous  to  a  situation  where 
the  correlator  is  searching  for  a  target  of  known  type  in  an  image,  with  the  best 
resulting  correlation  peak  denoting  its  location  and  orientation.  Once  the  images 
were  thus  processed,  each  of  the  correlation  planes  for  each  image  were  inspected 
for  the  location  of  the  correlation  peak.  The  highest  of  these  correlation  peaks  and 
its  location  were  noted.  The  success/failure  was  based  on  correlation  peak  location 
and  the  template  from  which  it  resulted.  If  the  location  of  the  correlation  peak  was 
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Figure  19.  Original  Image 


Figure  20. 


Local  Energy  Normalization  2-D  Correlation  Plane.  Notice  the  high- 
intensity  pixel  at  the  center  of  the  tank  as  expected. 
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Figure  21.  Correlation  Template  Set. 


located  a  distance  greater  than  half  the  diagonal  of  the  box  encircling  the  target, 
the  correlation  was  deemed  a  failure  due  to  the  distance  criteria.  Similarly,  if  the 
correlation  peak  resulted  from  a  target  at  a  substantially  different  orientation  or 
scale  than  what  is  in  the  template,  the  correlation  was  deemed  a  failure  due  to  the 
target  identification  criteria.  Figures  22  and  23  represent  a  successful  correlation 
of  a  template  with  another  image.  Figures  24  and  25  represent  a  correlation  that 
failed  both  the  distance  and  target  identification  criteria. 

The  results  from  this  were  encouraging.  Of  the  84  Ml  tank  images,  12  failed  due 
to  the  distance  criteria,  with  an  additional  five  fiulures  due  to  the  target  identification 
criteria.  This  results  in  67  out  of  84  images  correctly  located,  for  a  success  rate  of  80 


Figure  22.  Original  Image 


Figure  23.  Local  Energy  Normalization  2-D  Correlation  Plane.  Notice  the  high- 
intensity  pixel  at  the  center  of  the  tank  as  expected. 
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Figure  24.  Original  Image 


Figure  25.  Local  Energy  Normalization  2-D  Correlation  Plane.  The  upper-most 
cross-hairs  indicate  the  true  location  of  the  target,  while  the  lower-most 
cross-hairs  indicate  the  location  found  by  correlation. 
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percent.  Of  the  18  airplane  images,  two  failed  the  distance  criteria  with  an  additional 
two  failing  the  target  identification  criteria,  resulting  in  14  out  of  18  images  correctly 
located  and  a  success  rate  of  78  percent.  Finally,  of  the  78  truck  images,  five  failed 
the  distance  criteria,  with  an  additional  11  failing  the  target  identification  criteria, 
resulting  in  62  out  of  78  successfully  located  and  a  79  percent  success  rate.  The 
overall  rate  is  143  out  of  180  for  a  success  rate  of  79  percent. 

The  final  step  in  the  local  normalization  process  was  to  correlate  all  180  images 
with  all  14  templates  and  note  the  highest  correlation  peak  of  the  resulting  14  cor¬ 
relation  peaks  found  for  each  image.  This  peak,  its  location,  and  the  template  from 
which  it  came  were  used  for  the  success/failure  determination.  Using  this  technique, 
the  results  dropped  significantly.  Out  of  the  84  Ml  tank  images,  65  failed  the  target 
identification  criteria  with  an  additional  two  failing  the  distance  criteria,  for  a  suc¬ 
cess  rate  of  17  out  of  84  (20  percent).  For  the  airplane  images,  10  failed  the  target 
identification  criteria  and  two  failed  the  distance  criteria  for  a  success  rate  of  six  out 
of  18  (33  percent).  Out  of  the  78  truck  images,  63  failed  the  target  identification 
criteria  with  one  failing  the  distance  criteria  for  a  success  rate  of  15  out  of  78  (19 
percent).  The  overall  success  rate  of  this  technique  is  38  out  of  180  (21  percent).  An 
example  of  a  correlation  failing  both  the  distance  criteria  and  target  identification 
criteria  is  found  in  Figures  26  and  27. 

Inspection  of  these  results  showed  an  interesting  trend.  A  total  of  141  images 
found  the  best  correlation  peak  when  matched  with  a  template  made  from  an  A- 10 
airplane.  However,  there  are  only  four  images  in  the  entire  image  set  that  contain 
an  A- 10.  Clearly,  this  template  was  skewing  the  results.  Perhaps  removing  this 
template  would  leave  a  better  set  of  templates  and  show  an  increased  success  rate. 

The  previous  correlation  was  again  run  with  the  exception  that  the  A-10  tem¬ 
plate  was  removed  from  the  template  set.  The  results  were  no  more  encouraging. 
Another  template,  one  with  an  Ml  tank  at  a  smaller  scale,  was  identified  as  the  best 
match  98  out  of  180  times.  Removing  this  template  also  resulted  in  another  Ml 
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Figure  26.  Original  Image 


Figure  27.  Local  Energy  Normalization  2-D  Correlation  Plane.  The  upper-most 
cross-hairs  indicate  the  true  location  of  the  target,  while  the  lower-most 
cross-hiurs  indicate  the  location  found  by  correlation.  This  correlation 
also  failed  to  identify  the  target  correctly. 
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tank  template  dominating.  Continually  removing  dominant  templates  as  they  ap¬ 
pear  soon  lead  to  a  case  where  all  Ml  templates  had  been  removed,  ensuring  that  the 
84  Ml  images  would  fail  the  target  identification  criteria.  Thus,  removing  dominant 
templates  did  not  lead  to  a  better  set  of  templates. 

Now  that  the  energy  normalized  and  local  energy  normalized  images  have  been 
investigated,  correlation  of  the  raw  images  was  examined  to  determine  if  the  success 
rate  increases. 

^.^.2  Raw  Images  Each  raw  image  was  correlated  with  a  template  made 
from  the  cutout  target  found  in  it.  The  target  in  the  image  was  located  and  a  box 
encircling  the  target  noted.  This  box  containing  the  target  was  extracted  from  the 
image  and  placed  in  the  center  of  a  128  x  128  template,  with  pixel  values  of  zero 
filled  around  it. 

Of  the  84  Ml  tank  images,  three  found  the  known  target  center  as  the  correla¬ 
tion  peak  (four  percent).  Of  the  18  airplane  on  runway  images,  all  failed  to  find  the 
known  target  center  (0  percent).  Finally,  out  of  the  78  truck  images,  30  were  able 
to  find  the  known  target  center  (38  percent).  This  resulted  in  an  overall  18  percent 
success  rate  for  finding  targets  in  raw  images.  Again,  for  the  first  cut,  this  implied 
that  conventional  correlation  is  not  a  viable  alternative  for  this  problem. 

However,  the  same  region  of  low  pixel  values  was  again  noted  as  it  was  in  the 
correlation  of  the  energy  normalized  images.  Reference  Figure  28  and  Figure  29. 
Again,  the  images  were  correlated  and  the  technique  of  finding  the  location  of  the 
minimum  correlation  value  was  used. 

Using  the  correlation  minimum  to  find  the  location  of  the  target,  24  of  the 
84  Ml  tank  images  were  able  to  find  the  known  target  center  (29  percent).  Of  the 
18  airplane  on  runway  images,  all  again  feuled  to  find  the  known  target  center  (0 
percent).  Of  the  78  truck  images,  28  found  the  center  (36  percent).  This  resulted 
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Figure  28.  Original  Image 


Figure  29.  Local  minimum  Correlation  Plane.  Notice  the  large  group  of  black  pixels 
at  and  around  the  center  of  the  known  position  of  the  tank. 
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in  an  overall  29  percent  success  rate  for  finding  targets  in  the  raw  images  when 
searching  for  the  minimum  correlation  value. 

Comparing  the  successes  of  the  two  techniques,  there  was  an  overlap  in  success¬ 
ful  tank  correlations  of  two  images,  and  an  overlap  of  successful  truck  correlations 
of  eight  images.  That  is,  in  10  images,  both  the  correlation  peak  and  correlation 
minimum  were  able  to  locate  the  target.  Combining  the  success  rate  of  both  tech¬ 
niques  and  taking  into  account  this  overlap,  25  of  84  Ml  tank  images  (30  percent), 
0  of  18  airplane  images  (0  percent),  and  50  of  78  truck  images  (64  percent)  have 
been  successfully  located,  for  an  overall  success  rate  of  42  percent.  Comparing  these 
results  with  those  found  with  energy  normalization  show  that  the  same  images  were 
successful  for  both  the  correlation  peak  and  correlation  minimum  techniques.  As 
such,  other  rules  needed  to  be  developed  to  increase  the  utility  of  correlation  as  a 
target  location  and  identification  system. 

Inspection  of  the  remaining,  unsuccessful  2-D  and  3-D  correlation  planes  lead 
to  possible  heuristic  rules  or  techniques  to  find  and  identify  targets.  These  techniques 
follow. 

The  first  heuristic  technique  involved  examining  the  3-D  correlation  plane  for 
the  existence  of  a  secondary  peak.  A  secondary  peak  is  a  large,  well  defined  peak, 
whose  maximum  correlation  value  is  not  the  correlation  peak.  Using  the  secondary 
peak  as  a  rule  for  target  location  is  already  documented  by  'liroxel  (20)  in  his  re¬ 
search.  Reference  Figure  30  and  Figure  31.  Searching  the  3-D  correlation  plane  for 
secondary  peaks,  additional  targets  were  located.  Specifically,  no  additional  tanks  (0 
percent),  10  of  18  additional  airplanes  (56  percent),  and  five  of  78  additional  trucks 
(six  percent)  were  located.  The  overall  success  rate  for  this  technique  was  eight  per¬ 
cent.  Combining  the  result  of  this  technique  with  the  previous  results  gave  success 
rates  of  25  of  84  tank  images  (30  percent),  10  of  18  airplane  images  (56  percent), 
and  55  of  78  truck  images  (71  percent)  for  an  overall  success  rate  of  50  percent. 
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Figure  30.  Original  Image 


Figure  31.  Secondary  Peak.  The  peak  on  the  right  side  of  this  graph  coincides  with 
the  known  center  of  the  airplane. 
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Figure  33.  Heuristic  Large  Local  Minimum  Region.  Notice  the  dark  region  of  low 
pixel  values  at  the  known  location  of  the  tank. 
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The  second  heuristic  technique  involved  searching  the  2-D  correlation  plane  for 
a  lone  region  approximately  the  known  size  of  the  target  to  see  if  a  region  of  small 
pixel  values  is  present.  Reference  Figure  32  and  Figure  33,  there  is  a  dark  region  in 
the  area  of  the  tank  position.  Using  this  rule,  16  of  84  tank  images  (19  percent), 
none  of  the  airplane  images  (0  percent),  and  14  of  78  truck  images  (18  percent)  were 
located  for  a  success  rate  for  this  technique  of  17  percent.  The  individual  success 
rates  climbed  to  41  of  84  tank  images  (49  percent),  10  of  18  airplane  images  (56 
percent),  and  69  of  78  truck  images  located  (88  percent).  The  overall  success  rate 
has  now  climbed  to  120  of  180  targets  located  (67  percent). 

Another  heuristic  technique  searched  the  2-D  correlation  plane  for  a  small 
region  of  low  pixel  values  hidden  among  many  such  small  regions  in  the  image. 
Referencing  Figure  34  and  Figure  35,  there  is  a  dark  region  in  the  area  of  the  tank 
position,  however,  it  is  not  the  only  dark  region  present.  Using  this  rule,  19  of  84 
tank  images  (23  percent),  no  airplane  images  (0  percent),  and  no  truck  images  (0 
percent)  were  located.  This  technique’s  success  rate  is  11  percent.  Overall,  60  of  84 
tank  images  (71  percent),  10  of  18  airplane  images  (56  percent),  and  69  of  78  truck 
images  (88  percent)  have  been  located.  The  overall  success  rate  has  now  risen  to  77 
percent. 

The  final  heuristic  technique  searches  the  2-D  correlation  plane  for  a  small 
pinpoint  of  high  pixel  values  among  a  dark  region.  Reference  Figure  36  and  Figure  37, 
we  can  see  that  such  a  pinpoint  does  exist.  Using  this  rule,  23  of  84  tank  images  (27 
percent),  three  of  18  airplane  images  (17  percent),  and  2  of  78  truck  images  (three 
percent)  were  located. 

Combining  the  results  from  all  techniques  used  to  correlate  the  raw  images 
resulted  in  83  of  84  tank  images  located  (99  percent),  13  of  18  airplane  images 
located  (72  percent),  and  71  of  78  truck  images  located  (91  percent).  This  lead  to 
an  overall  success  rate  of  167  out  of  180  (93  percent).  Using  all  of  the  heuristic 
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Figure  35.  Heuristic  Small  Local  Minimum  Region.  Notice  the  region  enclosed  in 
the  b(»c  at  the  location  of  the  tank. 
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Figure  36.  Original  Image 


Figure  37.  Heuristic  Small  Pinpoint  Region 
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rules  along  with  the  correlation  peak  and  correlation  minimum,  a  rule-based  expert 
system  can  be  developed  using  correlation. 

The  overall  flowchart  showing  all  of  the  different  correlation  techniques  is  shown 
in  Figure  38. 
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Figure  38.  Correlation  Flowchart 
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4-5  Summary 


This  chapter  presented  the  results  achieved  by  the  various  correlation  routines 
and  rules.  Local  energy  normalization  was  used  to  pre-process  the  set  of  images 
before  correlation.  When  a  perfect  template  was  assumed,  the  correlation  success 
rate  was  lOOconclusions  reached  as  a  result  of  this  research  effort. 
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V.  Conclusions 


This  chapter  briefly  summarizes  the  results  reported  in  Chapter  IV. 

5.1  Research  Summary 

The  goal  of  this  effort  was  to  determine  the  viability  of  correlation  as  an  au¬ 
tonomous  target  recognizer  in  visible  light  terrain  board  imagery.  The  images  used 
were  a  representation  of  low  light  level  visible  imagery.  Both  raw,  energy  normal¬ 
ized,  and  locally  energy  normalized  images  were  correlated  using  KHOROS  routines 
and  the  correlation  peak  and  correlation  minimum  were  noted  and  compared  to 
the  known  target  location.  Heuristic  rules  were  also  developed  to  facilitate  target 
location. 

The  first  part  of  this  effort  involves  pre-processing  the  images  via  energy  nor¬ 
malization.  Each  image,  and  the  template  made  from  that  image,  were  energy 
normalized  before  correlation.  Once  the  image  was  correlated  with  its  template,  the 
location  of  the  correlation  peak  and  correlation  minimum  were  compared  against  the 
known  target  centers  and  the  success  rate  of  each  was  determined.  When  using  the 
correlation  peak,  3  Of  84  tank  images  (four  percent),  0  of  18  of  the  airplanes  images 
(0  percent),  and  30  of  78  truck  images  (38  percent)  were  located,  for  a  success  rate 
of  33  out  180  (18  percent).  When  the  correlation  minimum  was  used,  the  success 
rate  climbed.  Of  the  84  tank  images,  24  targets  were  located  (29  percent),  0  of  the 
18  airplane  images  were  located  (0  percent),  and  28  of  78  truck  images  (36  percent) 
for  a  success  rate  of  52  out  of  180  (29  percent). 

Combining  the  energy  normalization  success  rates  for  both  the  correlation  peak 
and  correlation  minimum,  and  accounting  for  overlap  between  the  two  techniques  of 
two  tank  images  and  eight  truck  images,  gives  success  rates  of  25  of  84  tank  images 
(30  percent),  0  of  the  18  airplane  images  (0  percent),  and  50  of  78  truck  images  (64 
percent)  for  an  overall  success  rate  of  75  of  180  (42  percent). 
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Another  pre-processing  technique,  called  local  energy  normalization  was  used 
to  test  the  effectiveness  of  the  correlator.  A  sequence  of  three  experiments  were 
run  with  increasing  level  of  difficulty.  The  first  experiment  involved  extracting  a 
template  from  an  image  and  then  local  energy  normalizing  both  the  template  and 
the  image  and  correlating.  This  is  analogous  to  assuming  that  the  correlator  has  a  set 
of  perfect  images  available  for  target  location  and  identification  as  well  as  proving 
the  integrity  of  the  correlator.  As  expected,  all  180  images  found  the  correlation 
peak  at  the  known  center  of  the  target,  with  the  correlation  peak  equal  to  unity. 

The  second  experiment  involved  identifying  a  set  of  templates  that  represent 
the  image  set  in  terms  of  scale  and  orientation  of  all  the  possible  targets.  Once 
identified.  Ml  tank  images  were  correlated  with  Ml  tank  templates,  airplane  images 
were  correlated  with  airplane  templates,  and  truck  images  were  correlated  with  truck 
templates.  The  largest  correlation  peak  found  for  each  image  was  scored  against  two 
criteria,  the  distance  criteria  and  the  target  identification  criteria.  Failure  of  one 
or  both  criteria  resulted  in  a  failed  correlation.  The  results  were  encouraging;  67 
of  84  Ml  tank  images,  14  of  18  airplane  images,  and  62  of  78  truck  images  were 
successfully  located  for  an  overall  success  rate  of  143  of  180  (79  percent). 

The  final  experiment  correlated  all  180  images  with  all  14  templates.  The 
largest  of  the  14  correlation  peaks  found  for  each  image  was  again  checked  against 
the  two  criteria  and  the  success  or  failure  determined.  Results  were  significantly 
lower.  Out  of  the  84  Ml  tank  images,  17  were  located  for  a  success  rate  of  20 
percent.  A  total  of  six  of  the  18  airplane  images  were  located,  for  a  success  rate  of 
33  percent.  Of  the  78  truck  images,  a  total  of  15  were  located,  for  a  success  rate  of 
21  percent.  The  overall  success  rate  of  this  technique  was  21  percent. 

The  second  part  of  this  effort  involves  correlating  the  raw  images  and  noting  the 
magnitude  of  the  correlation  peak  and  correlation  minimum  as  well  as  their  location. 
Using  only  the  location  of  the  correlation  peak,  the  success  rate  was  extremely  low. 
Of  the  84  Ml  tank  images,  only  three  were  located,  for  a  success  rate  of  four  percent. 
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Of  the  18  airplane  images,  all  failed  to  locate  the  known  target.  Finally,  of  the  78 
truck  images,  only  18  were  located,  for  a  success  rate  of  23  percent.  The  overall 
success  rate  of  using  only  the  correlation  peak  was  an  abysmal  12  percent. 

Using  only  the  location  of  the  correlation  minimum  also  led  to  a  low  success 
rate.  Of  the  84  tank  images,  24  were  located  (29  percent).  Again,  none  of  the  air¬ 
planes  were  successfully  located.  Finally,  of  the  78  truck  images,  27  were  successfully 
located  (35  percent).  The  overall  success  rate  of  using  only  the  correlation  minimum 
led  to  an  overall  success  rate  of  28  percent. 

Also,  a  set  of  heuristic  rules  were  developed  to  enhance  the  effectiveness  of 
the  correlator.  The  success  rate  for  the  first  heuristic  rule,  using  the  presence  of 
a  secondary  peak,  led  to  the  following  success  rates:  no  tanks  located  (0  percent), 
10  airplanes  out  of  18  located  (13  percent),  and  five  trucks  out  of  78  (six  percent) 
located.  The  overall  success  rate  for  using  the  location  of  the  secondary  peak  for 
target  location  was  eight  percent. 

The  second  heuristic  rule  developed  was  to  search  the  2-D  correlation  plane  for 
a  lone  region  of  low  pixel  values  approximately  the  size  of  the  known  image.  Using 
this  rule,  the  success  rates  were:  16  of  84  tanks  located  (19  percent),  none  of  the  18 
airplane  images,  and  27  of  78  truck  images  (35  percent).  The  overall  success  rate  for 
this  technique  was  24  percent. 

Another  heuristic  rule  developed  was  to  agmn  search  the  2-D  correlation  plane 
for  a  region  of  small  pixel  values.  Using  this  rule,  the  success  rates  were:  19  of  84 
tank  images  (23  percent),  no  additional  truck  images  or  airplane  images  located. 
The  overall  success  rate  for  this  rule  was  11  percent. 

The  final  heuristic  rule  involved  searching  the  2-D  correlation  plane  for  a  pin¬ 
point  of  relatively  high  pixel  values  among  a  region  of  relatively  low  pixel  values. 
Using  this  rule,  the  success  rates  were:  23  of  84  tank  images  (27  percent),  three  of 
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18  airplane  images  (17  percent),  and  2  of  78  truck  images  (three  percent)  located. 
The  overall  success  rate  for  this  rule  was  16  percent. 

Combining  the  results  from  all  of  the  heuristic  rules  with  the  results  from  the 
correlation  peak  and  correlation  minimum  lead  to  individual  success  rates  of:  83  of 
84  tank  images  (99  percent),  13  of  18  airplane  images  (72  percent),  and  71  of  78  truck 
images  (91  percent)  located.  The  final,  overall  percent  rate  for  all  180  images  was  93 
percent  for  correlating  raw  images.  These  results  show  that  correlation  can  be  used 
as  the  basis  for  a  rule-based  expert  system  for  autonomous  target  recognition. 

Conclusion 

The  results  of  this  research  effort  show  that  correlation  is  indeed  a  viable  tech¬ 
nique  for  locating  targets  in  visible  light,  terrain  board  imagery.  Classical  correlation 
searches  the  correlation  plane  and  reports  the  maximum  value  as  the  location  of  the 
target.  However,  this  by  itself  was  not  shown  to  be  a  viable  target  locator.  Other 
rules  needed  to  be  developed.  Given  these  types  of  images,  the  rules  for  successfully 
locating  the  targets  were  not  known  a  priori,  however  they  were  easily  found.  When 
all  of  the  various  rules  are  taken  as  a  group,  correlation  is  indeed  a  viable  technique 
for  autonomous  target  location  and  recognition.  These  various  rules  form  the  basis 
of  a  rule-based  expert  system. 
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Appendix  A.  Loading  the  Imagery 


Both  sets  of  data  arrived  on  Sun  tapes.  To  unload  these  onto  Scgraph,  use  the 
tape  drive  located  in  Room  132  of  Building  640.  First,  create  a  subdirectory  using 
the  “md”  command.  Then  enter  this  subdirectory  by  using  the  “cd”  command. 
Once  in  the  appropriate  subdirectory,  load  the  tape  into  the  tape  drive.  Since  the 
data  on  the  tape  is  stored  in  the  “tar”  format,  unload  the  tape  by  typing  “tar  xvf 
/dev/rstO”,  which  tells  the  machine  that  tar  formatted  data  will  be  read  from  a 
scuzzy  tape  (rst)  from  device  0  (the  tape  drive). 

As  files  are  unloaded,  their  names  will  scroll  on  the  screen.  The  total  time  to 
unload  a  tape  is  around  15  or  20  minutes  depending  on  how  much  is  stored  on  it. 
Once  the  tape  is  finished  unloading,  the  tape  drive  will  automatically  rewind  to  the 
beginning.  Remove  the  tape  from  the  tape  drive  and  keep  handy  for  emergencies. 

The  data  was  unloaded  into  various  sub-subdirectories,  which  should  be  left  as 
is.  ANVIL  expects  files  to  be  in  certain  subdirectories,  and  to  change  file  locations 
is  to  tempt  fate. 


51 


Appendix  B.  HIPS  Format 


The  HIPS  format  is  new  to  AFIT.  It  consists  of  a  header  followed  by  the 
actual  binary  gray  scale  data.  A  description  of  the  header  information,  taken  from 
the  ANVIL  Software  User’s  Manual,  follows. 

The  HIPS  image  format  description; 

blank  string 
blank  string 

number  of  frames (always  1) 

blank  string 

rows  (128  or  512) 

cols  (128  or  512) 

bits  per  pixel  (always  8) 

bit  packing  (always  0) 

pixel  format  (always  0  and  the  image  must  be  bytes) 
blank  string 
blank  string 
. (always  a  period) 
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Appendix  C.  Image  Conversions 


Given  the  C  code  to  convert  binary  to  floating  point  gray  scale  values,  it  was  a 
simple  matter  to  put  in  a  section  at  the  beginning  to  read  in,  and  discard,  information 
up  to  the  period  that  delineates  the  header.  Once  this  period  is  read  in,  the  program 
continues  on  as  originally  written  to  perform  the  conversion.  This  program,  titled 
HIPS2float.c,  is  listed  in  Appendix  D.l.  Another  useful  tool  is  a  C  Shell  to  do  a  batch 
conversion  of  all  files  with  the  .hips  extension.  This  C  Shell,  titled  doHIPS2asc  and 
found  in  Appendix  D.2,  will  read  in  each  file  with  the  .hips  extension  and  convert  to 
a  floating  point  ascii  value  representation.  It  should  be  noted  here  that  the  various 
programs  and  routines  may  call  this  representation  either  ascii  or  floating  point. 
These  can  be  considered  completely  interdiangeable  for  our  purposes. 

Once  the  files  are  in  the  ascii  format,  they  can  be  converted  to  any  number  of 
file  formats,  including  gray  scale,  rle,  and  eps  formats.  The  routines  to  do  this  are 
resident  on  the  Silicon  Graphics  machines  and  are  easily  accessible.  These  programs 
have  been  linked  into  one  C  Shell  program  so  that  the  file  can  be  left  in  whatever 
format  necessary  by  simply  commenting  out  the  appropriate  lines.  This  C  Shell, 
called  mwenzel,  is  listed  in  Appendix  D.3.  Included  in  this  file  is  the  get4d  routine 
which  will  display  the  image  onto  the  screen  so  that  you  can  see  it.  The  following  is 
a  description  of  the  functions  and  programs  called  by  the  mwenzel  C  Shell. 

•  HIPS2float  infile  outfile.asc  -  this  strips  off  the  header  information  from  the 
infile  and  converts  the  HIPS  formatted  image  to  ascii  format. 

•  float^ay  infile.asc  outfile.gra  -  this  converts  ascii  files  to  gray  value  files 

•  graytorle  -o  outfile.rle  128  128  infile.gra  -  takes  the  gray  file,  of  dimension  128 
X  128  pixels,  and  converts  to  rle  format 

•  rleflip  -V  infile.rle  outfile.rle  -  This  will  flip  upside  down  rle  images  to  right  side 
up  images.  This  step  is  unnecessary  if  the  image  is  already  correctly  displayed 
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•  get4d  infile.rle  -  this  will  display  the  rle  file  to  the  screen  for  us  to  see 

•  crop  0  32  127  95  infile.rle  >  outfile.rle  -  crops  the  original  from  128  x  128  to 
128  by  64  by  cutting  oflF  the  top  and  bottom  quarter.  The  cropped  file  will 
have  a  total  of  8192  bytes.  The  parameters  in  the  routine  call  are  min-x  min-y 
max-x  and  max-y  respectively. 

•  rleprint  infile.rle  >  outfile.asc  -  converts  the  rle  file  to  ascii 

•  float^a  infile.asc  outfile.gra  -  puts  the  file  in  final  form  for  processing. 

•  rle2eps  $maketemp.rle  2.5  -  converts  an  rle  file  to  eps  for  output  in  documents 

C.l  Upside-doxvn  Image  Conversion 

Should  any  displayed  image  be  upside  down,  it  can  be  put  right-  side  up  by 
using  the  rleflip  program.  Notice  that  the  input  to  this  program  is  the  rle  version  of 
the  file  and  the  output  is  also  a  rle  file.  This  routine  is  part  of  the  rawenzel  C  Shell 
and  can  be  invoked  by  removing  the  #  in  front  of  it. 

C.2  KHOROS  Conversion 

HIPS  formatted  files,  once  converted  to  their  ascii  representations  by  the 
HIPS2float  program,  are  ready  to  be  processed  by  KHOROS.  If  the  image  is  up¬ 
side  down,  KHOROS  will  still  process  it  as  always,  however  it  is  tougher  on  the 
human  operator.  This  can  be  rectified  by  the  rleflip  program. 


54 


Appendix  D.  C  Programs  and  C  Shells 

D.  1  HIPS2float.  c 
/♦  HIPS2f loat . c  ♦/ 

tinclude  <8tdio.h> 
iinclude  <math.h> 
idefine  skip.line  gets(junk) 

•define  loopi(stopi)  for(i»0;i<(stopi) 

•define  loopjCstopj)  for(j=0; j<(8topj) ; j++) 

•define  loopabj (start j ,  stopj)  for(j*(startj) ; j<»(8topj) ; j++) 
•define  loopabiCstarti.  stopi)  for(i«(8tarti):i<-(8topi) :i‘M-) 
•define  loopkCstopk)  for(ksO,*k<(8topk):k'M-) 

•define  loopij(A,B)  for(i»0;i<(A) U**)  for(j«0; j<(B) ; j++) 
•define  fskip.lineCA)  fgetsCjunk,  256,  A) 

mainCargc.argv) 
int  argc; 
char  ’•■argv  □ ; 

{ 

FILE  ffout,  *fin:  int  i,  j,  k,  count«0; 
unsigned  char  outval; 
float  inval,  max,  min; 
char  junk[256]; 

if (argc  !■  3){ 

printfC'i!!  The  command  line  should  be  !!!:\n\n 
Terrain. ASC  infile  outfile\n\n''); 
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ezit(O) ; 

} 

Set  Up  FlleS 

if  ((fin=fopen(argv[l]  ,"r''))  “  KULL)-( 
printfC'I  can’t  open  the  input  file");  exit(-l);  } 

if  ((fout*fopen(argv[2] ,"w"))  »*  NULL){ 
printfC'I  can’t  open  the  output  file");  exit(-l);  } 

/**  skip  header  information  **/ 

vhileC  fgetc(fin)  !»  • .*  ) 

{ 

fskip.lineCfin) ; 

printfC  count  «  5ld\n",  count ♦♦); 

} 

fskip.lineCfin) ; 

Skip  last  carriage  return  after  "."  *♦**/ 

/*  get,  convert  and  write  data  */ 
whileC  (inval  «  (float)  fgetc(fin))  !*  EOF) 
f printf (font , "Xf \n" , inval) ; 

fclose(fin) ; 

fclose(fout) ; 

printf ("\n  I’m  Done!\n"); 
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} 

End  of  Main  ♦♦*♦**♦**♦♦♦♦*♦♦♦♦♦♦/ 


D.2  doHIPS2asc 

t!/bin/c8h  foreach  fileC^.hips) 
set  temp  *  $file:r 
HIPS2float  $f lie. hips  $file.asc 
end 

echo  "I’m  all  done  converting  .hips  files  to  .asc  files" 
D.3  rawemel 


f  >.  /bin/csh 

if  ($1  «•  ""  II  $2  ««  ""  II  $3  =»  ""  1 1  $4  »»  "")  then 
echo  " 

echo  "The  proper  format  is:" 
echo  "  " 

echo  "ravenzel  filename  rootname  xsize  ysize 
echo  "  " ; 

exit  -1 
endif 

set  maketemp  *  $2:r 

HIPS2float  $1  Smaketemp . asc 

float .gray  $maketemp . asc  $maketemp.gra 

graytorle  -o  $maketemp.rle  $3  $4  Imaketemp.gra 

i  rleflip  -V  Smaketemp.rle  >  fimaketemp.rle 
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get4d  f$maketemp.rle 

i  crop  0  32  127  95  f$maketemp.rle  >  $maketemp . rle 
f  get4d  Imaketenp . rle 
i  rm  $maketemp . gra 
i  m  Imaketemp . asc 

i  rleprint  Snaketemp.rle  >  Smaketemp.asc 
i  float.gray  $maketenp . aec  $maket emp.gr a 

#  rm  Smaketemp . asc 

i  rle2ep8  $maketemp . rle  2.5 
echo  "I’m  all  done" 

D.4  movefile 

•!/bin/csh 
foreach  file(*.gra) 
set  temp  ■  $file:r 
mv  $f lie. gra  Sfile.img 

#  cp  $f lie. gra  Ifile.img 
end 

echo  "I’m  all  done  converting  .gra  files  to  .img  files" 

D.5  docorrelate.c 

/*  docorrelate . c 

*  calls  C  Shell  correlate 

*  reads  in  data  from  autocor.dat 
*/ 

tinclude  <stdio.h> 
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mainO  { 

int  i.x.y.w.Ji.xc.yc.xo.yo; 

FILE  ♦ipf; 

char  cmd [140] , ecmd [140] , image [30] ; 
ipf  *  fopen("autocor.dat'',''r") ; 
for  (i=0;  i<180;  ++i){ 

fscanfCipf ,''Xs5id*/,d*/.dJid%d%d)id%d",  image,  kx.  ky,  kv,  kh,  kxc, 

kyc, 

kyo,  kxo); 

sprintfCcmd,  "correlate  Xd  %d  %d  Xd  Xd  Xd  Xd  Xd",  image, 

X , y , w , h , xc , yc , xo , yo) ;  pr intf ( "XdXsNn" , i , cmd) ; 

sprintf (ecmd,  "ecorrelate  eXs  Xd  Xd  Xd  Xd  Xd  Xd  Xd  Xd",  image, 
x,y,w,h,xc,yc,xo,yo); 

systemCcmd) ;  system(ecmd) ; 

} 

} 

D.6  medfilter.c 

tinclude  <math.h> 
f include  <8tdio.h> 

idefine  EXTENT  9  /*  size  of  image  i.e  128  x  128  */ 
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idefine  WINDOW  5  /*  size  of  window,  i.e  5  x  5  */ 

void  main(argc,argv) 
int  argc; 
char  ♦argv  [] ; 

{ 

FILE  *ofp,*ifp; 

float  val [EXTENT] [EXTENT] , outval [EXTENT] [EXTENT] ; 
float  sortval [WINDOW’i’WINDOW]  .temp, median; 
int  i, j , k,l,m,n.q,r, offset, newdim, count; 

if  ((ifp=fopen(argv[l] ."r"))  ==  NULL){ 

/*  if  ((ifp*fopen("","r''))  ==  NULL){*/ 

printfC'I  can’t  open  the  input  file''); 
exit(-l); 

} 

if  ((ofp=fopen(argv[2] ,"w"))  ==  NULL){ 

/*  if  ((ofp«fopen("","w"))  ==  NULL){*/ 

printfC'I  can’t  open  the  output  file"); 
exit(-l); 

} 

for  (i=0;  KEXTENT;  ++i){ 
for  (j=0;  jcEXTENT;  +*j){ 
val[i]  [j]=0.0; 
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outvalCi][j]  =  O.O; 


} 

} 

for  (i»0;  i<EXTENT;  +-*-i){ 
for(j=0;  j<EXTENT;  ++j)< 

(f scanf  (ifp,  "‘/f ''  ,*val Ci]  Cj] )) ; 

} 

} 

offset  =  (WIMD0W-l)/2; 
newdim  =  EXTENT  -  WINDOW  +  1; 

h  printf ("offset  is  %d  and  nevdim  is  Xd\n", off set, newdim) ;♦/ 

count  *  0; 

for(i=offset;  icCoffset-t-nevdim) ;  ++i){ 

for(j=offset;  j<(offset-»-newdim) ;  ++j){ 

for(l=(i-offset) ;  l<(i+offset+l) ;  ++!){ 
for(m=(j-offset) ;  m<(j+offset+l) ;  ++m){ 

/*  printf ("i  is  %d  is  Xd,l  is  Xd,  m  is 
Xd\n",i,j,l.m);  >/ 

/*  read  values  into  sortval  for  sorting*/ 

sortval [count]  -  val [1] [m] ; 

/*  printf ("sortval [Xd]  is  Xf\n", count, val [1] [m]);  ♦/ 
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count  =  count  +  1; 
}  /*  close  m  */ 


}  /♦  close  1  */ 
count  *  0; 

/*  begin  bubblesort  on  sortval [WIND0W*VIND0W]  */ 

for(q=0;  q<((WIMD0W*MIMD0W)-l) ;  •►+q){ 
for(r=((WIND0W*HIin)0W)-l);  r>q;  “r){ 
if  (sortval[r-l]  <  sortval[r]){ 
temp  =  sortval[r-l] ; 
sortval [r-1]  =  sortval [r] ; 
sortvalCr]  =  temp; 

}/*  close  if  */ 

}/*  close  r  ♦/ 

}/*  close  q  ♦/ 

h  end  bubblesort  */ 

/♦  for  (k=0;  k<(WIHD0W*WIHD0W) ;  ++k){ 

printf ( "Xf \n" , sortval [k] ) ; 

} 

printf  ("\n'');*/ 

median  «  sortval [( (WINDOU^WINDOV) -1 ) /2] ; 

/*  printf ("med  is  Xf \n\n%''. median) ;*/ 

outval[i][j]  =  median; 

}  /*  close  j  ♦/ 

}/♦  close  i  ♦/ 
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for  (i=0;  i<EXTENT;  ++i){ 


for  (j=0;  j<EXTENT;  ++j){ 

f  printf  (of p ,  "Xf  \n'' ,  outval  [i]  [ j  ] ) ; 

} 


}  /♦  end  of  main  */ 

D.  7  domedfUter 

#!/bin/csh 
foreach  f ileC^.asc) 
set  temp  »  $file:r 
set  outfile  »  $temp.med 

echo  $outfile 

medf liter  $file  $outfile 

end 

echo  "I’m  all  done  median  filtering  .asc  files" 
D.8  enorm 

/*  This  will  energy  normalize  images  */ 

iinclude  <8tdio.h> 
finclude  <math.h> 

nainCint  argc,char  **argv) 


63 


int  i.j; 

FILE  ♦ofp,*ifp; 
float  val[128][128]; 
double  eval[128] [128] .sum; 

ifCargc  !=  3)  { 
printf  (''\n%s%sXs\n\n" , 

“usage:  ",  argv[0],  "  infile, 
exit(l); 

} 

for  (i*0;  i<128;  ++!)•[ 
for  (j=0;  j<128;  ++j){ 
valCi]  [j]»0.0; 
evalCi]  [j]=0.0; 

} 

} 

sumsO . 0 ; 

ifp  «  fopen(argv[l] ,"r") ; 
ofp  s  fopen(argv[2] ,"w"); 

for  (i*0;  i<128;  ++!){ 
for(j=0;  j<128;  ++j){ 

(fscanf (ifp, “Xf", Aval [i] [j])) 

} 

} 


outfile") ; 
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for  (i*0;  i<128;  ++!){ 

for  (j=0;  j<128;  ++j){ 

sum  s  sum  (double)  (val  [i]  [j]*val[i]  [j]) ; 

} 

} 

sum  -  sqrt(sum): 

for  (i=0;  i<128;  ++i){ 

for  (j=0;  j<128;  ++j)-( 

eval[i][j]  =  valCi]  [j]/sum; 

} 

} 

for  (i*0;  i<128;  ++!){ 
for  (j«0;  j<128;  ++j){ 

f printf  (of p ,  ''%lf  \n” , eval [i]  [j] ) ; 

} 

} 


} 

D.9  doenorm 

#!/bin/csh 
«  cd  ../Ml IMAGES 
foreach  file(>t>.asc) 
set  temp  «  $file 
erorm  $file  e$file 
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end 

echo  "I’m  all  done  enorming  .asc  files" 

D.IO  correlate 
*!/bin/csh 

set  infile  s  $i 
set  xs$2 
set  ys$3 
set  vs$4 
set  hs$5 
set  xc=$6 
set  yc=$7 
set  xos$8 
set  yo«$9 

set  roots$l:r 

set  outfile=$root .dtS 


asc2viff  -i  ./KTRUTH/$inf ile  -o  /tmp/rawlO  -fi  0  -t  0  -r  128 
-c  128  -m  1  -n  1  -b  1  -d  0  -h  0  -8  0 

vextract  -i  /tmp/ravlO  -o  /tmp/rav20  -x  $x  -y  $y  -w  $v  -h  $h 
vfft  -il  /tmp/rawlO  -ol  /tmp/rav30  -d  0 

vpad  -i  /tmp/raw20  -o  /tmp/rav40  -r  128  -c  128  -d  $yo  -e  $xo 
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-j  0  -k  0 


vfft  -il  /tmp/rav40  -ol  /tmp/rav50  -d  0 

vconj  -i  /tmp/rav50  -o  /tmp/rav60 

vmul  -il  /tmp/ravSO  -i2  /tmp/raw60  -o  /tmp/ravTO 

vfft  -il  /tmp/raw70  -ol  /tmp/rav80  -d  1 

vtranslat  -i  /ttap/rawSO  -o  /tmp/rav90  -x  64  -y  64  -w  1 

vctor  -i  /tmp/raw90  -o  /tmp/ravlOO  -t  3 

vstats  -i  /tmp/ravlOO  -f  ./KTRUTH/$outfile  -all  1  -mu  0  -var  0  -sd  0 
-rms  0  -vmax  0  -xmax  0  -ymax  0  -vmin  0  -xmin  0  -ymin  0  -in  0  -pin  0 
-nin  0  -pts  0  -ppts  0  -npts  0  -sk  0  -kur  0  -ent  0  -con  0  -b  1 

txprismS  -title  "$infile"  -imlp  iml32  -alpha  90.0  -theta  0.0  -i  11 
/tmp/ravlOO 


D.ll  checkjnoTm.  c 

I*  This  will  check  energy  normalize  images  */ 

iinclude  <stdio.h> 
tinclude  <math.h> 
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main(argc.argv) 
int  argc; 
char  ♦argv  [] ; 


int  i.j; 

FILE  ♦ofp,*ifp; 
float  val [128] [128] : 
double  eval[128] [128] .sum; 

if  (argc  !=  2)  •[ 
printf  ("\n%s5ls%8\n\n'' , 

’•usage:  ”,  argv[0],  ”  infile"); 
ezit(l); 

} 

sun^O.O; 

if  ((ifp  =  fopen(argv[l] ,"r”))  =®  HULL){ 
printfC  I  can’t  open  ifp  X8",argv[l] ) ; 
exit(-l) ; 

} 

for  (i»0;  i<128;  ++i){ 
for(j=0;  j<128;  ++j){ 
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(fscanf (ifp,"Xlf",teval[i] [j])); 
sum  =  sum  +  (eval[i]  [j]*eval[i]  [j]  ) ; 


} 

} 

pr intf ( "Xf \n" , sum) ; 

} 

D.12  make-template.  c 

/*  make.template . c  reads  in  truth  data  from  autocor.dat 
extracts  the  target  and  places  in  template  names 
*  "T"  followed  by  original  images  name. 

♦/ 

f include  <8tdio.h> 

mainO 

int  i,j,k,x,y,w,h,xc,yc,xo,yo,l.m; 

FILE  ♦ipfl,*ipf2,*opf ; 
char  image [60] , si [60] ; 
float  orig_file[128] [128] .val[128] [128] ; 

ipfl  *  f open ( "ant ocor.dat'',''r''); 

for  (k»0;  k<180;  ++k){ 
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fscaiif(ipfl/'%8%dr.d%d7.dXd*/ld%d%d".  image,  kx.  kj.  kJi,  kh. 
kxc,  kjc,  kxo,  kyo); 


/*  open  original  image  for  reading  */ 

if  ((ipf2  =  fopen(image,''r‘'))  HULL){ 

printfC  I  can’t  open  ipf2  XsXn". image); 
exit(-l) ; 

} 

/*  read  in  data  from  original  image  */ 

for  (i=0;  i<128;  ++i){ 
for  (j=0;  j<128;  ++j)-C 

f scanf  (ipf2,  '7.f "  ,korig_f  ile[i]  [j]  ) ; 

} 

} 

/*  backfill  template  array  with  all  0.0  values  */ 

for  (i=0;  i<128;  ++i){ 
for  (j=0;  j<128; 
val[i][j]  =0.0; 

} 

} 

/*  fill  in  template  array  with  target  pixels, 
and  center  (one  step)  */ 
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1  =  yo; 

for  (i«y;  i<y+h;  ++i){ 
n  =  xo; 

for  (j=x;  j<x+w;  ++j){ 

val[l]  [m]  =  orig_file[i]  [j]  ; 
m  =  m  +  1; 

} 

1=1+1; 

} 

/+  prepare  outfile  name  T  +  original  image  name 
and  store  template  array  +/ 

sl[0]  =  »T’; 
sl[l]  =  ’\0»; 

St r cat (si, image) ; 

printfC'JCd  7.s\n'',k,sl) ; 

if  ((opf  =  fopen(sl,"w''))  ==  HULL){ 
printf  ("  I  can’t  open  opf  \n"); 
exit(-l) ; 

} 

for  (i=0;  i<128;  ++i){ 
for  (j=0;  j<128;  ++j){ 
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f printf  (opf ,  "%f \n'' , val  [i]  [j] ) ; 


} 

} 

fclo8e(ipf2) ; 
fclose(opf) : 

} 

} 

D.13  new-correlate 
#!/bin/c8h 

8et  image  =  $1 

8et  template  =  $2 

8et  root  *  $l:r 

8et  outfile  »  e$root.tdat 

8et  eimage  -  e$image 

8et  etemplate  =  e$template 

techo  $image  $eimage 

iecho  $template  Setemplate 

techo  $root 

iecho  $outfile 

techo  $e image 

techo  $etemplate 

echo"" 

enorm  $ image  $e image 


72 


enorm  $template  $etemplate 


asc2viff  -i  . ./KTRUTH/$eimage  -o  /tmp/ravlO  -fi  0  -t  0 
-r  128  'C  128  -m  1  -n  1  -b  1  -d  0  -h  0  -s  0 


vfft  -il  /tmp/ravlO  -ol  /tmp/rav20  -d  0 


asc2viff  -i  . . /KTRUTH/letemplate  -o  /tmp/raw30  -fi  0  -t  0 
-r  128  -c  128  -m  1  -n  1  -b  1  -d  0  -h  0  -s  0 

vfft  -il  /tmp/rav30  -ol  /tfflp/rav40  -d  0 

vconj  -i  /tmp/rav40  -o  /tmp/rav50 


vmul  -il  /tmp/rav20  -i2  /tmp/raw50  -o  /tinp/raw60 
vfft  -il  /tmp/raw60  -ol  /tmp/raw70  -d  1 
vtranslat  -i  /tmp/raw70  -o  /tmp/rav80  -x  64  -y  64  -w  1 
vet or  -i  /tmp/raw80  -o  /tmp/rav90  -t  1 

vstats  -i  /tiiip/raw90  -f  .  ./KTRUTH/$outfile  -all  1  -mu  0  -var  0  -sd  0 
-rms  0  -vmax  0  -zmaz  0  -ymaz  0  -vmin  0  -xmin  0  -ymin  0  -in  0  -pin  0 
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-nin  0  -pts  0  -ppts  0  -npts  0  -sk  0  -kur  0  -ent  0  -con  0  -b  1 


echo  $e image 
echo  $etemplate 
rm  $e image 
rm  $etemplate 

D.I4  do.new-correlate.c 

/*  do.new. correlate . c 

>•'  calls  C  shell  nev.correlate 

*  reads  in  data  from  autocor.dat 

*1 

tinclude  <stdio.h> 

j 

I 

mainC) 

{ 

int  i,x.y,w,h,xc.yc,xo,yo; 

FILE  ♦ipf; 

char  cmd [140] , ecmd [140] , image [30] ; 
ipf  =  fopen("autocor,dat","r") ; 
for  (i=0;  i<180;  ++i){ 

fscanf (ipf ,"*4s7,d%d7,d*/.d*/,d*/,dJidXd",  image,  Jkx,  kj,  kv,  Jfch, 
kxc,  kyc,  kyo,  kxo); 
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sprintf (cmd,  "nev.correlate  %8  T%s",  image,  image); 
printf ("%d  %s\n",i,  cmd); 
systemCcmd) ; 

} 

} 

D.15  localnorm.c 

/*  reads  in  truth  data  from  autocor.dat 

*  opens  filename  stored  in  image  for  reading 

*  reads  in  pixel  values 

*  extracts  target  from  image  using  truth  data, 

*  stores  as  template 

*  energy  normalizes  the  image  and  template. 

*  performs  point  by  point  correlation 

* 

*t 

iinclude  <stdio.h> 
finclude  <math.h> 


mainO 

{ 

int  i,j,k,x,y,w,h,xc,yc,xo,yo,l,m,a,b, count; 

FILE  ♦ipfl,*ipf2,*opf ; 
char  image [60] , si [100] ; 

float  orig.file[128] [128] ,val[128] [128] .vindov[128] [128] ; 
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double  evindov[128] [128] ,sum.etemp[i28] [128] , cor [128]  [128] ; 

I*  open  the  file  containing  the  truthing  data  */ 
ipfl  *  fopen("autocor.dat'',''r”); 
for  (k=0;  k<180;  •♦’+k)-[  /*  begin  k  counter  */ 

/*  read  in  truth  data  for  each  file  */ 

fscanf(ipfl/'y.sy.d*/.dy.dy.d*/.dy.dy.d%d''.  image,  kx,  kj,  k».  kh.  kxc, 
kjc,  kxo,  kyo); 

/*  use  truth  data  to  read  in  each  image  */ 

if  ((ipf2  *  fopen(image,''r''))  NULL){ 
printfC  I  can’t  open  ipf2  %s''. image); 
exit(-l); 

} 

printf ("reading  in  Xd  Xs\n",k, image); 
for  (i=0;  i<128;  ++i){ 
for  (j=0;  j<128;  ++j){ 

f scanf (ipf2, "Xf ” ,torig_f ile[i] [j] ) ; 


} 

} 

/*  extract  target  from  image  using  truth  data  and 
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find  the  sum  of  the  squared  pixel  values  */ 

1  =  0; 
sum  =  0.0; 
count  =  0; 

for  (i=y;  i<y+h;  ++i){ 
m  =  0; 

for  (j=x;  j<x+w;  ++j){ 
count  =  count  +  i; 
val[l]  [m]  =  orig_file[i]  [j]  ; 
sum  =  sum  double )( val  [1]  [m]*val[l]  [m]); 

m  =  m  +  1; 

} 

1  =  1+1; 

} 

sum  =  sqrt(sum); 

/*  energy  normalize  the  template  +/ 

for  (i=0;  i<h;  ++i){ 
for  (j=0;  j<w;  ++j)-( 

etemp[i][j]  =  val[i]  [j]/sum; 

} 

} 
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I*  perform  local  normalization  for  all  positions 
in  the  image  of  same  size  as  the  template 

for  (i=0;  i<128;  ++i){  /*  over  all  possible  ♦/ 

for  (j=0;  j<128;  ++j){  /♦  positions  in  the  image  ♦/ 
1  =  0; 
sum  =  0.0; 


I*  index  on  size  *^1 
/*  of  template  ♦/ 

for  (a=i-(h/2);  a<i+((h-*-l)/2) ;  ++a){ 
m  =  0; 

for  (b=j-(w/2);  b<j+((w+l)/2) ;  +-*-b)"( 

window [l][m]  =  orig.file [BOUND (a)] [BOUND (b)] ; 
sum  =  sum  *  (double) (window [1] [m]*window[l] [m]); 
m  =  m  +  1; 

} 

1=1+1; 

} 

sum  =  sqrt(sum); 

/*  energy  normalize  window  */ 

/+  and  store  in  ewindow  */ 

for  (a=6;  a<h;  ++a){ 
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for  (b=0;  b<w;  ++b){ 


ewindov [a] [b]  =  window [a] [b] /sum; 

} 

} 

sum  =  0.0; 

/♦  point  by  point  multiply  ♦/ 

I*  ewindow  and  etemp  and  sum  */ 
for  (a=0;  a<h;  ++a)< 
for  (b«0;  b<w;  -M-bX 

sum  =  sum  (ewindow  [a]  [b]  *  etemp  [a]  [b] ) ; 

} 

} 

I*  store  correlation  value  in  cor  array  *! 
cor[i]  [j]  =  sum; 


} 


} 


/«  prepare  si  to  have  name  of  output  image 
and  write  cor  uray  to  it  */ 
sl[0]  *  »\0»; 
strcat(8l,"cor") ; 
strcat (si, image) ; 

printf ("%s\n\n" ,sl) ; 
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if  ((opf  =  f open (si, "w"))  ==  NULL){ 
print  f  ("I  can’t  open  opf); 
exit(-l); 

} 

for  (i*0;  i<128:  ++i)-( 
for  (j=0;  j<128;  ++j){ 

f printf  (opf ,  "7.f  \n'' ,  cor  [i]  [j]  ) ; 

} 

} 

fclose(sl) ; 
fclose(ipf2) : 

}  /♦  end  of  k  counter  ♦/ 

}  /♦  end  of  main  ♦/ 
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Appendix  E.  Image  Truth  Data 

The  following  truthing  data  was  stored  in  the  file  autocor.dat.  The  format  of 
this  file  is  as  follows: 

image  z  y  v  h  xc  yc  xo  yo 

image  is  the  name  of  the  ascii  file  that  holds  the  image  data 

X  is  the  upper  left  x  coordinate  used  with  the  KHOROS  glyph  vextract 

y  is  the  upper  left  y  coordinate  used  vith  the  KHOROS  glyph  vextract 

V  is  the  width  in  pixels  used  vith  the  KHOROS  glyph  vextract 
h  is  the  height  in  pixels  used  with  the  KHOROS  glyph  vextract 

xc  is  the  X  coordinate  of  the  center  pixel  of  the  target  (calculated) 

yc  is  the  y  coordinate  of  the  center  pixel  of  the  target  (calculated) 

xo  is  the  offset  of  column  used  with  KHOROS  glyph  vpad 

yo  is  the  offset  of  row  used  with  the  KHOROS  glyph  vpad 

Nl_e0n2vl28s2.asc  50  47  23  24  61  59  52  52 

Nl_e0n3vl28s2.asc  50  48  24  24  62  60  52  52 

Nl.e0n5bvl28s2.asc  53  49  24  24  65  61  52  52 
Nl.e0n5evl28s2.asc  48  59  23  23  59  70  52  52 
Ml.e0n5fvl28s2.asc  62  64  22  21  73  74  53  53 
Nl_e0n5vl28s2.asc  48  46  26  24  61  58  51  52 

Ml.e0n6vl28s2.asc  49  47  25  24  61  59  51  52 

Mi_e0n7vl28s2.asc  49  48  25  23  61  59  51  52 
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Ml.e0n8vl28s2.asc  48  48  26  23  61  59  51  52 
Nl.e0n9vl28s2.asc  46  47  28  24  60  59  50  52 
Ml_e0p0cvl28s2.asc  57  48  25  23  69  59  51  52 

Nl.e0p0dHl28s2.asc  47  52  25  24  59  64  51  52 

Nl.e0p0evl28s2.asc  47  57  25  23  59  68  51  52 

Nl_e0p0fvl28s2.asc  63  65  19  20  72  75  54  54 

Ml_e0pl00s2vl28.asc  44  50  56  23  72  61  36  52 

Nl_e0pl05s2vl28.asc  43  50  56  23  71  61  36  52 

Ml_e0pl0bs2Hl28.asc  38  55  25  23  50  66  51  52 

Nl.e0pl0cs2vl28.asc  41  55  28  25  55  67  50  51 

Ml_e0pl0fs2vl28.asc  43  58  24  22  55  69  52  53 

Nl_e0pll0s2Hl28.asc  44  49  56  22  72  60  36  53 

Ml.e0pll5s2vl28.asc  46  50  55  22  73  61  36  53 

Ml.e0pl5s2vl28.asc  40  46  32  28  56  60  48  50 
Ml_e0plvl2882.asc  50  46  23  25  61  58  52  51 
Nl.e0p20s2vl28.asc  40  48  34  27  57  61  47  50 

Ml.e0p25s2Hl28.asc  40  48  37  27  58  61  45  50 

Nl.e0p2Hl28s2.asc  50  48  23  23  61  59  52  52 
Nl.e0p30s2Hl28.asc  40  48  39  27  59  61  44  50 

Ml.e0p35s2Hl28.asc  41  48  42  27  62  61  43  SO 

Nl.e0p3Hl28s2.asc  50  48  24  24  62  60  52  52 
Nl.e0p40bs2Hl28.asc  31  57  41  23  51  68  43  52 

Nl.e0p40fs2Hl28.asc  40  60  35  20  57  70  46  54 

Nl.e0p40s2Hl28.asc  41  48  43  26  62  52  42  51 
Nl.e0p45ds2Hl28.asc  35  55  43  25  56  67  42  51 

Nl.e0p45es2Hl28.asc  39  63  39  25  58  75  44  51 

Nl.e0p4Sfs2Hl28.asc  40  59  35  22  57  70  46  53 

Nl.e0p45s2Hl28.asc  42  48  45  27  64  61  41  50 
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Ml_e0p4vl28s2.asc  49  47  25  25  61  59  51  51 
Nl.e0p50b82vl28.asc  31  58  47  25  54  70  40  51 

Ml_e0p50c82vl28.asc  33  56  47  25  56  68  40  51 

Ml.e0p50fs2vl28.asc  40  60  38  21  59  70  45  53 

Nl.e0p50s2vl28.asc  42  49  49  27  66  62  39  50 

Nl.e0p5582vl28.a8C  43  50  49  25  67  62  39  51 

Nl.e0p5bvl2882.asc  26  47  26  24  39  59  51  52 

Nl.e0p5fvl28s2.asc  62  64  21  21  72  74  53  53 

Nl.e0p5vl2882.a8C  50  47  25  25  62  59  51  51 
Nl.e0p60s2vl28.asc  44  48  52  27  70  61  38  50 

Nl.e0p6582vl28.asc  45  48  53  27  71  61  37  50 

Nl.e0p6vl28s2.asc  49  47  27  25  62  59  50  51 
Nl.e0p7082vl28.asc  45  48  55  25  72  60  36  51 

Nl.e0p75s2vl28.asc  47  49  56  25  75  61  36  51 

Nl.e0p7vl2882.asc  48  47  28  25  62  59  50  51 
Nl.e0p80e82vl28.a8C  45  62  48  22  69  73  40  53 

Nl.e0p80f82vl28.a8C  41  60  45  20  63  70  41  54 

Nl.e0p85(ls2vl28.a8C  28  59  58  22  57  70  35  53 

Nl.e0p85es2vl28.asc  44  60  47  23  67  71  40  52 

Nl.e0p85fs2vl28.asc  42  60  45  20  64  70  41  54 

Nl.e0p85s2vl28.a8C  49  49  57  24  77  61  35  52 
Nl.e0p8vl2882.asc  49  45  26  26  62  58  51  51 
Nl.e0p90bs2vl28.asc  32  52  59  22  61  63  34  53 

Nl.e0p90fs2vl28.asc  42  60  47  19  65  69  40  54 

Nl.e0p9082vl28.asc  50  49  58  25  79  61  35  51 

Nl.e0p95s2vl28.asc  41  48  58  26  70  61  35  51 

Nl.e0p9vl28s2.asc  49  45  28  27  63  58  50  50 
Nl.eln5bvl2882.a8C  78  80  28  28  92  94  50  50 
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Appendix  F.  KHOROS 


KHOROS  is  an  image  processing  package  which  is  basically  a  set  of  approxi¬ 
mately  260  routines.  Cantata  is  the  visual  programming  environment  for  the  KHOROS 
system.  To  execute  cantata,  simply  enter  “cantata”  on  the  command  line  in  the  cmd- 
tool  on  a  SUN  workstation.  It  will  take  about  one  minute  for  cantata  to  completely 
load  and  display  the  masterform  workspace  on  the  screen. 

F.l  Using  KHOROS 

Designing  a  sequence  of  KHOROS  commands  can  be  done  in  two  ways.  First, 
if  the  exact  command  is  known,  it  can  be  located  using  the  “ROUTINES”  box  on  the 
upper  left  side  of  the  masterform.  Clicking  on  “ROUTINES”  using  the  left  mouse 
button  will  bring  up  a  list  of  all  available  routines.  Use  the  left  mouse  button  to  click 
on  all  routines  needed.  Once  this  is  done,  select  “Use”  using  the  left  mouse  button. 
The  list  will  disappear  and  faint  outlines  of  the  routine(s)  selected  will  appear  under 
the  cursor.  Place  the  cursor  in  the  masterform  where  you  want  the  glyphs  and  click 
the  left  mouse  button.  The  glyphs  (icons)  will  appear  on  the  masterform. 

The  second  way  to  choose  glyphs  is  to  use  the  pull-down  menus  located  at  the 
top  of  the  masterform.  Select  the  correct  menu  by  holding  down  the  right  mouse 
button  and  pulling  the  menu  down.  When  the  correct  routine  is  highlighted,  release 
the  right  mouse  button.  The  faint  outline  of  the  selected  routine  will  appear  under 
the  cursor.  Place  the  cursor  in  the  masterform  where  you  want  the  glyphs  and  click 
the  left  mouse  button. 

Any  glyph  selected  that  is  no  longer  needed  can  be  deleted  by  selecting  the 
“bomb”  image  in  the  upper  left  comer  of  the  gl}q>h  using  the  left  mouse  button.  The 
center  blue  image  next  to  the  “bomb”  image  is  used  to  change  default  values  for  the 
glyph.  The  upper  right  image  on  the  glyph  resembles  a  light  switch.  Clicking  this 
with  the  left  mouse  button  will  execute  that  gl3l>h. 
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The  glyphs  also  have  boxes  along  their  left  and  right  edges,  each  with  an  arrow 
that  points  to  the  right.  The  arrows  on  the  left  edge  are  “input”  arrows  and  the 
arrows  on  the  right  edge  are  output  arrows.  To  connect  the  output  of  one  routine 
to  the  input  of  another,  simply  click  on  the  output  arrow  of  the  first  routine  and 
then  click  on  the  input  arrow  of  the  second  routine  using  the  left  mouse  button. 
The  connection  will  now  show  on  the  screen.  To  delete  a  connection,  place  the 
cursor  on  the  it  and  click  the  left  mouse  button.  This  will  bring  box  which  has 
“Delete  Connection”  on  the  upper  left.  Clicking  on  this  with  the  left  mouse  button 
will  delete  the  connection.  However,  sometimes  the  connection  will  not  be  erased 
from  the  screen.  This  can  be  rectified  using  the  “REDRAW”  command  as  discussed 
below. 

Once  the  appropriate  routines  are  on  the  masterform,  they  can  be  executed 
using  the  “RUN”  box  at  the  upper  left  of  the  master  form.  Each  routine  will  be 
executed  in  order  and  the  output  of  one  routine  will  automatically  be  fed  to  the 
input  of  the  next  routine  following  the  routes  you  have  designated.  K  any  routine 
does  not  execute  correctly,  a  blinking,  red,  unhappy  face  will  appear  on  the  glyph. 
This  can  be  selected  to  view  an  appropriate  error  message. 

Other  boxes  located  on  the  upper  left  of  the  master  form  include  “REDRAW” 
which  will  clean  up  the  masterform  and  erase  any  connections  or  glyphs  that  have 
been  deleted.  “RESET”  will  reset  all  glyphs  if  any  changes  have  been  made  to  them. 
“HELP”  will  give  some  information  about  KHOROS  and  on-line  documentation 
available.  “QUIT”  will  quit  KHOROS. 

F.2  Correlation  using  KHOROS 

To  generate  a  basic  correlation  sequence,  you  will  need  perform  the  following 
routines.  Each  routine  is  discussed  along  with  where  I  located  it 
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F.2.1  asc2viff  Asc2viff  converts  the  raw  data  (ascii)  to  the  VIFF  format 
needed.  It  is  found  under  “CONVERSIONS”  and  “Raw  File  Format’. 

You  can  designate  the  path  where  your  ascii  file  resides,  or  else  click  on  “Input 
ASCII  file”  using  the  left  mouse  button.  This  will  display  your  home  directory. 
The  path  to  the  correct  input  file  can  be  designated  by  sequentially  selecting  the 
appropriate  directories/subdirectories  using  the  left  mouse  button.  Other  important 
boxes  are  as  follows.  Make  sure  that  the  “ASCII  to  VIFF”  box  on  the  upper  left  is 
highlighted  in  black.  Also,  the  “data  type”  is  “REAL” .  Finally,  fill  in  the  number  of 
rows  and  columns  with  the  size  of  your  image.  Once  all  is  correctly  filled  in,  select 
“Glyph”  in  the  upper  left  corner. 

F.2.2  vextract  Vextract  will  extract  a  set  of  pixels.  It  is  found  under  “IM¬ 
AGE  PROCESSING”  and  “Subregion”,  and  “Extract  Subimage”.  To  designate  the 
correct  set  of  pixels,  input  the  correct  “Upper  left  X  coord.” ,  “Upper  left  Y  coord.” , 
“Width  in  Pixels”,  and  “Height  in  Pixels”.  Select  “Glyph”. 

F.2.3  vpcui  Vpad  will  take  the  extracted  image  and  pad  around  it  with  a 
designated  pixel  value.  It  is  found  along  with  vextract,  except  you  select  “Pad 
Image”  instead  of  “Extract  Subimage”.  Designate  the  size  of  the  padded  image  in 
“Row  size”  and  “Column  size” .  You  can  control  where  the  extracted  image  is  placed 
in  the  padded  image  by  designating  the  “Offset  of  row”  and  “offset  of  column”. 
Finally,  designate  the  Real  and  Imaginary  “Pad  Value”.  For  my  purposes,  both 
“Pad  Values”  were  0. 

F.2.4  vfft  Vfft  will  find  the  fast  Fourier  transform  of  an  image.  It  is  found 
under  “IMAGE  PROCESSING”,  “'D-ansforms” ,  and  “FFT”.  Specify  the  FFT  direc¬ 
tion  as  “Forward”  or  “Inverse”  by  using  the  left  mouse  button  on  the  “Forward”  or 
“Inverse”  box,  i.e.  this  box  toggles. 
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F.2.5  vconj  Vconj  will  find  the  complex  conjugate  of  an  image.  It  is  found 
under  “ARITHMETIC”,  “Unary  Arithmetic”,  and  “Conjugate”. 

F.2.6  vmul  Vmul  will  multiply  two  images  pixel  by  pixel.  It  is  found  under 
“ARITHMETIC”,  “Binary  Arithmetic” ,  and  “Multiplication”. 

F.2. 7  vtranslat  Vtranslat  will  translate  an  image.  It  is  found  under  “IMAGE 
PROCESSING”,  “Geometric  Manip.”,  and  “Translate  Image”.  Make  sure  that  the 
“X.lVanslation”  and  “Y.Translation”  are  exactly  half  of  the  width  and  height,  Re¬ 
spectively,  of  the  size  of  the  images  you  are  correlating.  This  relocated  the  position 
of  the  DC  component  found  by  the  Fourier  transformations. 

F.2.8  vconvert  Vconvert  will  perform  Viff  file  to  other  files.  It  is  found  under 
“CONVERSIONS”  and  “Data  Conversions”,  and  is  used  to  convert  Viff  to  byte 
formats  for  collecting  statistics. 

F.2.9  vpostscr  Vpostscr  will  Print  a  hardcopy  of  an  image  when  used  in 
conjuction  with  Vconvert.  It  is  found  under  “OUTPUT”  and  “Print  Image”.  It 
accepts  byte  file  format  from  the  vconvert  routine  and  sends  to  the  designated  printer. 

F.2.10  put-update  Put.update  will  display  an  image  to  the  screen.  It  is  found 
under  “OUTPUT”  and  “Display  Images”. 

F.2.11  vstats  Vstats  will  compute  statistics  on  a  Viff  file.  It  can  be  found 
under  “ROUTINES” .  Feeding  this  to  a  xwiewer  will  show  a  two  dimension  plot  of 
the  correlation  values. 

F.2.12  xwiewer  Xwiewer  will  display  to  the  screen  a  file.  When  used  in 
conjunction  with  vstats  is  will  list  various  statistics  for  you  to  read. 
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F.2.13  xprismS  XprismS  will  display  a  plot  to  the  screen.  When  used  in 
conjunction  with  vstats  it  will  display  a  three  dimension  plot  of  the  correlation. 

F.3  Invoking  KHOROS  using  C  and  C  Shells 

While  the  KHOROS  Cantata  command  form  is  a  very  useful  tool,  it  is  very 
time  consuming  to  use.  One  way  around  this  is  to  invoke  KHOROS  from  within 
a  C  program.  Since  each  KHOROS  routine  is  itself  executable  via  a  system  level 
command,  this  is  easily  done.  The  following  is  a  step  by  step  procedure  to  do  this: 

•  Build  the  KHOROS  network  using  Cantata.  Determine  which  routines  are 
necessary  and  in  what  order.  Also  note  which  parameters  in  the  KHOROS 
routines  are  image  dependent,  that  is,  which  paramaters  change  as  the  image 
changes.  These  will  be  passed  as  program  parameters  in  the  C  program. 

•  Debug  the  network.  This  is  self  explanatory.  The  network  must  work  as  you 
intend  it  to  work. 

•  Reset  the  Cantata  masterform.  This  will  ensure  all  routines  are  executed  one 
last  time  for  the  “capture” . 

•  Capture  the  KHOROS  execution.  Once  the  network  has  successfully  executed, 
quit  the  KHOROS  masterform.  In  the  command  tool  that  was  used  to  invoke 
KHOROS,  the  execution  of  the  network  is  documented.  Simply  highlight  the 
vai-ious  routine  executions  with  the  left  mouse  button,  hit  “copy”  on  the  left 
side  of  the  keyboard,  and  place  in  a  new  file  by  hitting  “paste” .  These  will  be 
made  into  a  C  Shell. 

•  Build  the  C  Shell.  In  the  file  that  contains  the  KHOROS  execution  capture, 
place  the  following  as  the  first  line 

#!/bin/csh 
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.  Next,  determine  which  variables  will  be  passed  to  the  C  Shell  and  in  which 
order.  For  example,  if  the  C  Shell  is  called  correlate  and  the  image  filename,  x, 
y,  w,  and  h  values  need  to  be  passed,  then  the  shell  can  be  called  by  correlate 
filename  xywh  with  the  appropriate  values  in  the  corresponding  place.  Inside 
the  shell,  these  \’alucs  can  be  referenced  by  $1,  $2,  $3,  $4,  and  $5  respectively. 
Determine  which  of  the  KHOROS  routines  have  variable  parameters  and  re¬ 
place  the  numeric  values  in  the  capture  with  the  $  variable  name.  Or  symbolic 
names  can  be  used  by  using  the  “set”  command.  For  example,  “set  infile  = 
$1”  will  allow  the  input  image  file  to  be  referenced  by  “$infile”  for  ease  of  use. 
If  the  output  file  is  to  have  the  same  root  name  as  the  input  image,  but  with 
a  different  extension,  then  use  a  two  line  sequen 

•  Make  the  C  Shell  executable.  Type  chmod  766  correlate  for  this  example. 

•  Build  the  input  data  file.  The  data  file  is  simply  a  laundry  list  of  input  file 
name  and  all  necessary  variable  parameters  to  be  used  by  the  C  Shell. 

•  Build  the  C  program.  The  C  program  will  open  the  input  data  file  for  reading 
via  the  fscanf  function.  Next,  a  “for”  loop  can  be  used  to  have  the  program 
execute  for  the  correct  number  of  files  listed  in  the  input  data  file.  A  fscanf 
function  is  used  to  read  in  the  image  to  be  processed,  along  with  the  associated 
variable  parameters.  The  actual  command  to  invoke  the  C  Shell  with  the 
correct  parameters  must  now  be  placed  into  a  string.  This  is  done  using  a 
sprintf  command,  such  as  the  following  example:  “sprintf(cmd,  ’’correlate  %s 
%d  %d  %d  %d”,  image,  x,y,w,h);”  will  place  the  call  to  the  C  Shell  in  the 
string  “cmd”.  This  can  now  be  invoked  by  the  line  “system(cmd);”. 

•  Compile  the  C  program.  Self  explanatory. 

•  Run  the  C  code.  Self  explanatory. 

An  example  of  this  can  be  found  in  Appendices  D.5  and  D.6  under  C  programs 
and  C  shells.  Correlate  is  the  C  Shell  that  contains  the  KHOROS  capture.  The  C 
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program  docorrelate.c  reads  in  data  file  autocor.dat,  and  passes  these  parameters  to 
the  correlate  C  Shell  180  times.  The  output  was  180  files  with  the  .dat  extension. 
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Appendix  G.  ANVIL 


ANVIL,  the  Artificial  Neural  Vision  Learning  System,  was  developed  by  Booz- 
Allen  &  Hamilton,  Inc.  of  Arlington,  Virginia  under  Contract  Number  F33615-89-C- 
1013  for  Wright  Laboratories  AAAT-1  at  Wright-Patterson  AFB,  OH.  ANVIL  is  an 
autonomous  target  recognition  (ATR)  software  package  that  takes  a  training  set  of 
HIPS  formatted  terrain  board  imagery,  learns  features  for  targets  taken  from  these 
images,  and  develops  a  neural  network  to  locate  these  targets.  The  description  of 
how  ANVIL  works  follows. 

G.l  How  ANVIL  Works 

ANVIL  can  conceptually  be  broken  up  into  three  stages  (2): 

•  Training  -  finding  features  in  a  training  set  that  can  be  used  to  distinguish 
between  training  images 

•  Associative  Mapping  -  associating  the  feature  positions  for  each  training  image 
with  the  appropriate  image 

•  Performance  -  looks  for  the  same  set  of  feature  in  the  same  geometry  in  the 
test  image 

A  discussion  of  each  stage  follows. 

G.1.1  Training  ANVIL  uses  a  type  of  template  matching  approach  for  the 
ATR  process.  Each  training  image  is  segmented  by  hand  to  identify  the  target  from 
the  background.  Once  an  image  is  segmented,  ANVIL  finds  a  set  of  templates,  called 
features,  that  are  11  x  11  pixels  in  dimension.  The  following  discussion  describes  how 
the  features  are  found. 

The  first  training  image  has  the  segmented  target  extracted  from  the  image 
and  padded  with  low  intensity  values  to  make  up  the  original  128  x  128  image  size. 
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This  image  is  then  compared  with  an  11  x  11  pixel  mask  called  the  novel  mask.  This 
mask  has  a  high  intensity  pixel  in  its  center  and  then  each  concentric  square  around 
this  has  decreasing  intensity  -  a  type  of  gaussian  mask.  The  mask  is  applied  to 
every  possible  place  in  the  128  x  128  training  image  and  the  similarity  metric  MSE 
calculated.  This  results  in  a  118  x  118  matrix  of  MSE  calculations  called  a  novel 
Cluster.  If  any  11  x  11  set  of  pixels  in  the  input  image  has  all  pixel  values  less  than 
20,  the  corresponding  error  value  in  the  novel  cluster  is  forced  to  the  maximum  error 
value  of  1.0.  This  step,  called  Information  Content  Gating  is  designed  to  remove  low 
intensity  areas  with  little  or  no  information  content.  Next,  any  pixel  in  the  118  x 
118  novel  cluster  that  is  above  the  information  threshold  will  be  replaced  with  the 
maximum  error  value  of  1.0.  Now,  we  find  the  minimum  MSE  in  the  118  x  118  novel 
cluster.  Once  the  minimum  error  term  is  found,  we  find  the  corresponding  11  x  11 
feature  in  the  input  image.  This  becomes  the  first  learned  feature. 

This  learned  feature  now  becomes  an  influence  into  how  the  novel  mask  learns 
the  next  feature,  whether  this  new  feature  comes  from  the  same  input  image  or  from 
a  successive  input  images. 

The  second  input  image  is  presented  for  trmning.  We  now  have  an  11  x  11 
training  feature  as  well  as  the  11x11  novel  mask.  Again,  the  input  image  is  presented 
to  the  novel  n.osk  and  a  118  x  118  novel  cluster  results.  Again,  any  11x11  region 
in  the  input  image  that  has  all  121  pixels  below  the  threshold  value  causes  the 
corresponding  value  in  the  novel  mask  118  x  118  error  matrix  to  be  set  to  1.0. 

The  second  input  image  is  also  presented  to  the  first  11  x  11  learned  feature 
which  also  results  in  an  118  x  118  cluster  called  a  feature  extraction  cluster  (FE 
cluster).  Since  we  have  only  the  one  learned  feature  at  this  time,  this  FE  cluster  has 
no  other  FE  cluster  to  compare  with,  so  it  is  directly  gated  to  the  Learned  Cluster 
Column  Exclusivity.  If  any  error  term  in  the  FE  cluster  is  below  a  threshold  value, 
called  the  separation  threshold.,  the  corresponding  term  in  the  novel  cluster,  not  the 
FE  cluster,  is  set  to  maximum  error  of  I.O.  The  FE  cluster  is  now  sent  through  Fea- 
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ture  Vigilance  Learned  Gating.  Any  error  term  above  the  feature  vigilance  threshold 
in  the  FE  cluster  will  be  forced  to  the  maximum  error  of  1.0.  The  minimum  error 
in  the  FE  cluster  is  found  and  the  values  within  the  exclusivity  region  surrounding 
this  minimum  error  are  forced  to  the  maximum  error  of  1.0.  This  process  is  repeated 
until  there  is  only  one  error  value  in  the  FE  cluster  for  each  exclusivity  region  not 
set  to  the  maximum  error  value.  The  surviving  FE  cluster  error  values  force  the 
corresponding  novel  cluster  error  values  to  the  maximum  error  value.  That  is,  if 
there  exists  a  low  error  value  in  the  FE  cluster,  then  this  feature  must  already  be 
represented  and  needs  to  be  excluded  from  being  learned  again.  The  novel  cluster 
is  searched  for  the  minimum  error  valur  Once  this  is  found,  the  corresponding  11  x 
11  pixel  are  in  the  input  image  is  extracted  and  becomes  the  new  learned  feature. 

The  processing  now  begins  anew  with  either  a  new  training  image,  or  by  cycling 
through  the  training  images  again.  Ag;.'  .n,  the  training  image  is  compared  with  the 
novel  mask,  but  it  is  also  compared  with  the  two  learned  features.  This  results  in  two 
FE  clusters.  These  two  FE  clusters  are  compared  on  a  “pixel-by-  pixel”  basis  and 
the  lowest  error  for  each  “pixel”  is  maintained  in  an  overall  combined  learned  cluster. 
This  process  is  called  the  Learned  Cluster  Column  Exclusivity  and  is  performed  when 
there  are  two  or  more  learned  features. 

The  processing  as  mentioned  above  is  continued  until  the  novel  mask  is  com¬ 
posed  of  only  maximum  error  values.  At  this  time,  feature  learning  is  completed  and 
the  next  step  is  begun. 

G.1.2  Associative  Mapping  Once  all  of  the  training  images  are  segmented 
and  the  features  are  learned,  each  image  is  processed  again.  This  stage  records  the 
various  features  and  their  relative  positions  in  each  training  image.  The  locations  of 
the  features  with  respect  to  the  center  of  the  segmented  target  are  used  to  set  up  the 
Spatial  Location  Layer  masks.  The  learned  features  are  also  combined  into  a  content 
addressable  memory  (CAM),  which  is  the  region  around  a  training  image  where  it  is 
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chosen  as  the  best  above  other  training  images.  From  the  features  and  their  relative 
positions,  ANVIL  then  builds  a  neural  network  architecture  based  on  this  to  find 
targets  in  the  test  images.  While  ANVIL  is  neither  perspective  or  scale  invariant, 
it  is  robust  to  scale  and  perspective  changes.  Booz-Allen  &  Hamilton  claims  the 
amount  of  robustness  to  be  ±  10  degrees  azimuth,  ±  5  degrees  elevation  and  ±  25 
percent  in  scale. 

G.1.3  Performance  Test  images  are  now  fed  to  the  newly  learned  neural 
network  architectur  .  In  the  feature  extraction  layer,  ANVIL  compares  each  training 
feature  with  every  possible  position  in  the  input  image  and  computes  a  similarity 
metric  which  measures  how  closely  the  feature  matches  that  image  at  that  position. 
These  similarity  metrics  are  then  combined  into  an  array  of  outputs,  called  clusters, 
for  each  learned  feature. 

The  spatial  location  layer  looks  for  the  best  feature  match  within  the  corre¬ 
sponding  mask  and  computes  a  similarity  metric.  Again,  this  is  performed  for  all 
possible  positions,  and  results  in  arrays  that  record  the  positions  of  the  best  feature 
matches  relative  to  the  spatial  locations  mask’s  center. 

Finally,  in  the  object  detection  layer,  feature  and  position  information  are  com¬ 
bined  into  an  overall  target  similarity  metric,  resulting  in  a  similarity  measure  for 
every  training  training  target  at  every  possible  target  location.  This  similarity  mea¬ 
sure  is  the  overall  error  between  the  ima^e  and  the  training  target  at  that  location. 
The  lowest  values  become  candidates  for  target  detections.  However,  a  threshold 
value  is  also  used  to  eliminate  those  with  higher  values. 

G.2  ANVIL  System  Requirements 

According  to  the  ANVIL  Software  User’s  Manual,  the  ANVIL  system  runs 
ONLY  on  the  following  list  of  hardware: 

•  Sun’s  Operating  System  (SunOS)  4.0.3  or  higher 


99 


•  Color  Sun-4  and  SPARC  workstations  with  a  three  button  mouse 

•  Both  8  bit  and  24  bit  color  (CG9)  is  supported 

•  Inmos  transputers 

•  20  MB  of  disk  space  for  the  source  code 
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